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University of Toronto 
2000 

The main contribution of this work is the definition of a quantifier-free string theory Ti suitable 
for formalizing ALOGTIME reasoning. After describing Li — a new, simple, algebraic charac- 
terization of the complexity class ALOGTIME based on strings instead of numbers — the theory 
Ti is defined (based on Li), and a detailed formal development of Ti is given. 

Then, theorems of Ti are shown to translate into families of propositional tautologies that 
have uniform polysize Pregc proofs, Ti is shown to prove the soundness of a particular Prege 
system J^, and is shown to provably p-simulate any proof system whose soundness can be 
proved in Ti. Pinally, Ti is compared with other theories for ALOGTIME reasoning in the 
literature. 

To our knowledge, this is the first formal theory for ALOGTIME reasoning whose basic ob- 
jects are strings instead of numbers, and the first quantifier-free theory formalizing ALOGTIME 
reasoning in which a direct proof of the soundness of some Prege system has been given (in the 
case of first-order theories, such a proof was first given by Aral for his theory AID). Also, the 
polysize Prege proofs we give for the propositional translations of theorems of Ti are consider- 
ably simpler than those for other theories, and so is our proof of the soundness of a particular 
.?^-system in Ti. Together with the simplicity of Ti's recursion schemes, axioms, and rules these 
facts suggest that Ti is one of the most natural theories available for ALOGTIME reasoning. 
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Chapter 1 



Introduction 

The starting point for this work is the following open problem in complexity theory, concerning 
propositional proof systems (for a good introduction to propositional proof systems, including 
the basic definitions, see Cook and Reckhow [l9]). 

Open Problem 1 Are Frege ("T") and extended Frege (^'eT") proof systems p-equivalent? 

To provide some motivation for studying Open Problem [T] and to give an indication of its 
importance, note its connection to some major open questions in complexity theory through 
the following facts. 

General Fact 1 If NP ^ coNP, then P ^ NP. 
General Fact 2 NP = coNP if and only if TAUT e NP. 

General Fact 3 TAUT G NP if and only if there exists a super (i.e., polynomially- 
hounded) proof system for TAUT. 

General Fact 4 Given two proof systems fi and f2, if fi is super and f2 p-simulates fi, 
then f2 is also super. 

From Cook and Reckhow's paper, we know that p-simulation imposes a partial order on proof 
systems. Determining the relative position of particular proof systems in this order helps shed 
some light on their relative power and, because of General Fact HI on such major open problems 

7 7 

as NP = coNP or P = NP. From this point of view, determining the exact position of Frege 
systems relative to extended Frege systems in this order is one of the most important questions 
still open in this area. For the rest of this chapter, I will give a short survey of the major results 
and issues connected with Open Problem [TJ 
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Chapter 1. Introduction 



1.1 e^-systems and P 

It is traditional in complexity theory to equate "feasible" with "polynomial time". Moreover, 
there is a close association between polytime and e.7-"-systems since eJ-'-systems can be thought 
of as reasoning on uniform polysize circuits. For the rest of this work, I will use the traditional 
notation "P" when referring to the class of polytime decidable languages, and "FP" when 
referring to the class of polytime computable functions. Over the years, many characterizations 
of the classes P and FP have been given, most notably Cobham's "£" and Bellantoni and 
Cook's "fi". 

• Cobham's C |16j is the first machine-independent characterization of the class FP using 
a form of bounded recursion on notation. 

• Bellantoni and Cook's B [4] uses a tiered approach (i.e., it distinguishes between "safe" 
and "normal" parameters) in order to dispense with explicit bounds as in Cobham's 
scheme. 

Also, many logical theories have been proposed to capture polytime reasoning, most notably 
Cook's "PF", Buss's "S*!", Leivant's "PT(W)", and various second-order theories. 

• Cook's PV [181 I2Q] is a free-variable equational theory based on Cobham's C Cook 
showed that every formula t = u provable in gives rise to a family of propositional 
tautologies which assert the equation and have uniform polysize eJ^ proofs, that PV can 
define and prove the soundness of eJ-, and that if the soundness of a propositional proof 
system T is provable in PV , then p-simulates T. 

• Buss's S2 [7\ is a system of Bounded Arithmetic that can define exactly the polytime 
functions. Buss showed that S| is Sj-conservative over PV (when its language is suitably 
extended to include all the function symbols of PV), which implies that proves the 
soundness of eJ- and that the S^-theorems of 5*2 can be translated into propositional 
tautologies that have uniform polysize e.F-proofs (by the corresponding results for PV). 

• Leivant's PT(W) [26] has generative axioms for W (intuitively, binary strings) and in- 
stances of W-induction as its only axioms. It proves the convergence of exactly the 
polytime functions over W (when induction is restricted to positive existential formulas). 
This formalization is conceptually and technically very simple because it does not rely on 
any particular initial functions, other than the algebra's constructors (in fact, the theory 
can talk about any computable function). 

• Buss's Vi (studied by Razborov [27]) and Leivant's L2{QF^) are two of the most 
notable examples of second-order theories for P. 
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1.2 :F-systems and ALOGTIME 

The computational power of .F-systems seems to be captured by the uniform class iVC^, since J-- 
systems can be thought of as reasoning on polysize formulas, which are the same as logarithmic- 
depth circuits. Recall that NC^ is the class of languages decidable by families of logarithmic- 
depth circuits {FNC^ is the functional equivalent, using multi-output circuits), and by results 
of Ruzzo [28], C/s.-uniform NC^ = ALOGTIME, where ALOGTIME is the class of languages 
decidable in logarithmic time by a random access alternating Turing machine. The functional 
class FALOGTIME can be defined in two different ways: if functions are thought of as operating 
on integers in binary notation, we get a "numerical" version of the class, whereas if functions 
are thought of as operating on strings of bits (which is closer to the circuit model), we get a 
"string" version of the class. Fortunately, with a suitable interpretation of numbers as strings 
(or of strings as numbers), both versions are equivalent. 

Therefore, for the rest of this work, I will use ALOGTIME and NC'^ interchangeably, 
always referring to the uniform version of the class (unless otherwise specified). Also, I will 
use ''FALOGTIME" (or "FA'"C^") to refer to the functional version of the class. Various 
characterizations of FALOGTIME have been given over the years, most notably Clote's "A/q" 
and ''Nq" , and Bloch's string algebra. 

• Clote's A'o and A^q |12l [T3l [T3] are "numerical" characterizations that use restricted forms 
of Cobham's recursion on notation. Unfortunately, Nq includes a complete function for 
FALOGTIME as a base function, and Nq depends on Barrington's deep result about 
bounded- width branching programs so neither algebra is as natural for FNG^ as 
Cobham's C is for FP. 

• Bloch's algebra O [6] is a "string" characterization that uses the "safe" versus "normal" 
parameter idea together with a form of recursion similar to Allen's "divide and conquer 
recursion" (DCR) [IJ. Bloch recognized that Allen's scheme of DCR (which Allen used 
to characterize uniform NC) is particularly well-suited to characterizing uniform parallel 
complexity classes. Combining this with the tiered approach allows him to dispense with 
explicit bounds on the rate of growth of functions and to give an elegant characterization 
that uses only simple base functions and one natural scheme of recursion. 

Based on Bloch's ideas but incorporating some of Clote's, I will introduce in Chapter [2] a new 
simple string algebra Li that characterizes FALOGTIME using very few simple base functions 
and two simple schemes of recursion (CRN and TRN, to be defined there). It appears to us 
that Li is simpler than previous characterizations because it has fewer, simpler base functions, 
and no need for explicit bounds on the growth of functions or for different types of parameters. 
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Based on the characterizations of ALOGTIME given above, a number of theories to capture 
ALOGTIME reasoning have been defined, most notably Clote's "^iF" and ''ALV"\ Takeuti 
and Clote's ''TNC^'\ and Aral's ''AW (all of which are based on "numerical" characterizations 
of ALOGTIME). 

• Clote's ALV and ALV' [13^ [14] are free-variable equational theories based on his char- 
acterizations of ALOGTIME mentioned above and on Cook's PV . Clote showed that 
theorems of ALV and ALV' give rise to families of tautologies which have polysize J-- 
proofs, but did not show that either of his theories can prove the soundness of ^-systems. 
Also, the proof that the prepositional translations of theorems of ALV or ALV' have poly- 
size ^-proofs is fairly involved, and properties of even simple functions (such as "parity" 
or "majority") are difficult to prove. 

• Takeuti and Clote's TNC^ [l5] (first defined by Takeuti [29]) is a first-order theory similar 
to Buss's 5*2 that was shown to be conservative over ALV' (when suitably extended to 
include every function symbol of ALV'). Unfortunately, this theory needs to use a fairly 
complex form of inference called hounded successive nomination, because of its implicit 
dependence on Barrington's result (through Clote's characterization of ALOGTIME), 
which detracts greatly from its simplicity. 

• Aral's AID [2] is a system of bounded arithmetic inspired by Buss's consistency proof 
for ^-systems |9j, which proves the soundness of J- and whose Sg-theorems have polysize 
J^-proofs when suitably translated. Moreover, Aral shows that AID is equivalent to a 
quantified version of Clote's ALV , and hence that ALV can prove the soundness of J-. 

Unlike the situation for P, there is no quantifier-free theory for ALOGTIME which has the sim- 
plicity and naturalness oi PV . I claim that Ti fills that role, its axioms and induction schemes 
being based directly on Li's simple base functions and natural recursion operations. Moreover, 
the proofs that propositional translations of the theorems of Ti have uniform polysize .F-proofs 
and that Ti can prove the soundness of ^-systems are much simpler than the corresponding 
proofs for other theories in the literature. 

1.3 Overview 

Now that I have provided some context and motivation for studying Open Problem [H let me 
give a brief overview of the rest of the thesis. In Chapter 2, I will introduce the string algebra 
Li, followed in Chapter 3 by the quantifier-free theory Ti (including a formal development of 
the theory, showing how to prove the pigeonhole principle in Ti). In Chapter 4, I will define 
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prepositional translations for theorems of Ti and show that they have polysize .F-proofs, while 
in Chapter 5, I will show that Ti proves the soundness of J^, by formalizing an algorithm for 
the "Boolean Sentence Value Problem" (BSVP) in Ti, and that provably p-simulates any 
proof system whose soundness can be proved in Ti. Finally, in Chapter 6, I will compare Ti 
with various other formalisms for ALOGTIME reasoning, most notably Aral's AID. 



Chapter 2 

The String Algebra Li 



In this chapter, we define Li and show that it contains exactly the functions in FALOGTIME. 
We also give many examples of natural Li definitions for simple FALOGTIME functions. 

2.1 Basic definitions 

The basic objects of the algebra are strings over the alphabet {0, 1}. The set of all such strings 
can be defined inductively: e (the empty string), 0, 1 are strings, and if x and y are strings, 
then so is xy. Together with a wish for simplicity, this inductive definition motivates our choice 
of base functions. 

The reader should keep in mind that our definitions in this chapter are based on, and guided 
by, the idea of computation by uniform families of circuits. In particular, all our functions will be 
length- determined, i.e., the length of a function depends only on the lengths of the arguments, 
not their values. Also, the starting point for our algebra Li is Bloch's paper [6], where he 
carries out a similar function-algebraic characterization of FALOGTIME, so we will borrow 
many concepts and definitions from there. (We also borrow certain concepts and definitions 
from Clote's work pj [H [Ml.) 

Now, we define the base functions and the basic operators that we will use to construct new 
functions. We use to denote the length of x (i.e., the number of symbols (bits) in the string 
x), Xk to denote a /c-tuple of variables, and x to denote an arbitrary tuple of variables. 

BASE: The set of base functions consists of (in order of increasing arity): 

e, 0, 1 = empty string, 0-bit, and 1-bit (constants), 

>-x = the [|x|/2] rightmost bits of x ("right half"), 
X <\y = X with \y\ bits removed from the right ("right chop"), 
X ■ y = X followed by y ( "concatenation" ) , 
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y if X = e, 

x7 {y,zo,zi) = < zo if X = w • for some w, ("conditional") 
zi if X = w ■ 1 for some w, 
X^(xi, . . . ,Xn) = Xk for any 1 < k < n ("identity" or "projection" 



Remark 2.1.1 In the definition of x ? (y, zg, zi), it is assumed that l^oj = \zi\. If that 
is not the case, then the value returned will be padded on the left with as many O's as 
are necessary to make zq and zi the same length (the length of y does not change) . 

COMP: / is defined from g and hi, . . . ,hk by composition if 

/(f) = g{hi{x),...,hk{x)). 

CRN: / is defined from h by concatenation recursion on notation on x if h(x, y) S {0, 1} for 
all x,y and 

/(e,y) = £, 
f{xi,y) = f{x,y)-h{xi,y) for i = 0, 1. 

TRN: / is defined from g, h, hg, and hr by tree recursion on notation on x if 
f{x,z,y) -- 



g{x,z,y) ifx = e,0, 1, 

h{x, z, y, f{x<, he{z),y), /(►x, hr{z),y)) otherwise, 



where x< = x < ►x (the [|x[/2j leftmost bits of x). In what follows, we will omit the 
parameter z when neither g nor h depend on it (in which case the functions h£ and hr are 
irrelevant and will not be specified); we will refer to this form of TRN as simple TRN. 

Remark 2.1.2 Our "right half" function was called "back half (i?/i)" by Allen [1] and 
Bloch [6]. We introduce the new nomenclature because we feel that it is more representa- 
tive of the action of the function, and the new notation to serve as a graphical reminder of that 
action (picture the black triangle cutting into the left part of x). Similarly, our "right chop" 
function was called "chop" by Cook [20] and "most significant part {Msp)" by Allen and Bloch. 
Our new notation should serve as a useful graphical mnemonic for the function's purpose and 
action (picture the bits of y cutting into the bits of x from the right — in the direction pointed 
to by the function symbol) . Our scheme of CRN is based on the operation of the same name in 
Clote's work [12l[T3l[ll], except that our version has been simplified by eliminating the function 
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g from the base case (without loss of generahty since we can simply concatenate g{y) to the 
left of our functions to get Clote's). Our scheme of TRN is based on Bloch's "very safe DCR", 
which is itself based on Allen's "DCR" (for "divide-and-conquer recursion"), except that our 
base case is simpler (defined for x = £,0,1 instead of when |x| < |6| for some extra parameter 
b) , and we have added the functions and hr that allow parameter z to vary during recursive 
calls (hence, TRN is technically a scheme of "recursion with replacement"). 

Definition 2.1.1 If we let TRN|^, represent the operation of TRN restricted to functions 
g E L and h, h(,,hr G L' , for function classes L and V , then 

• Lo is the closure of BASE under COMP and CRN; 

• Li is the closure of Lq under COMP, CRN, and TRN|^^, defined recursively. 

The next few sections contain mainly function definitions, where the following notational 
conventions will be used. 

• For any constant string c, Cfc represents the tuple consisting of k copies of c. 

• Unary functions have higher precedence than binary functions and binary functions have 
higher precedence than functions of higher arity (keep in mind that "?" has arity 4). 
Concatenation has higher precedence than any other binary function when represented 
by juxtaposition; it has lower precedence than any other binary function when represented 
by "•". 

• i and j represent arbitrary fixed single bits, whereas k, I, m, and n represent arbitrary 
fixed non-zero natural numbers. When 2*^ is used, k ranges over all natural numbers 
(including zero), and similarly for 2^. 

k 

• The notation k x x stands for x"^*^^ [i.e., x concatenated with itself k times). We let 
X X = £ and use A; as an abbreviation for A; x 1, i.e., the unary string representing k. 

2.2 Functions in Lq 

In this section, we define many functions in Lq and show that many useful generalizations of 
CRN can be simulated in Lq. We are motivated by two goals: to define the machinery necessary 
to prove that Li contains all of FNC^, and to show that many useful functions have simple 
definitions in our algebra. 
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2.2.1 Basic functions 

First, we define a few simple variations on some of the BASE functions. Tlie rightmost bit of 
x: x' = x7 (e, 0, 1) ; x with its rightmost bit removed: x< = x <] 1 ; the [|x|/2j leftmost bits of 
x: x-^ = X < ►.X. 

Next, a function that reverses the bits of x can be defined by first using CRN to define a 
function reverse(x, y), which returns the |y| rightmost bits of x reversed: 

reverse(a;,£) = e, 

reverse(x, yi) = reverse(x, y) ■ {x < y)' for z = 0, 1. 

Then, rev(a;) = reverse(a;, x) returns the reverse of x. Using this function, we can now define 
symmetric counterparts to some of the earlier functions: 

y> X = rev(rev(x) <\ rev(?/)), >x = rev(rev(x)<), V = rev(x)'. 

Now, let us introduce a generalization of CRN: a function / is defined from h by left CRN 
(or reverse CRN) on x if h{x, y) G {0, 1} for all x, y and 

f{e,y) = e, 

f{ix, y) = h{ix, y) ■ f{x, y) for i = 0, 1. 
If / is defined from h by left CRN on x, then we can use CRN to define 
aux_/(e,y) = e, 

aux_/(a;z, y) = aux_f{x, y) ■ h{re\/{xi),y) for i = 0,1, 

and f{x,y) = rev(aux_/(rev(x), y)), using COMP. In what follows, we will use the notational 
conventions outlined before this section and we will no longer include the trivial base case 
/(e, y) = e OT write "for i = 0, 1" when using CRN to define new functions. 

2.2.2 String manipulation functions 

Now, we will define useful functions for manipulating strings. First, two simple functions that 
returns a string of the same length as its input, but consisting entirely of O's or entirely of I's: 

j{xi)=jx-j (by CRN). 

Next, we can define a number of functions to compare the lengths of strings (these function 
symbols will be distinguished by putting a superscript next to them). First, it is useful 
to have a conditional that tests for the length of a string: x (y, z) = x 7 {y, z, z) is equal 
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to y ii X is empty and equal to z otherwise. Because distinguishes only between "empty" 
and "non-empty", we will define the "length-relational" functions below so that they return 
£ when the relation holds and some fixed non-empty string (like "1") otherwise. Accordingly, 
we define a simple signum function that returns e if its argument is empty and 1 otherwise: 
pa^x = X (e, 1) and a corresponding "negation": -i^x = x Now, we are ready to 

define the comparison functions. 

x>^ y = ^^{x >y) X <^ y = ^^{x < y) x =^ y = \> y) ■ {x < y)) 

x>^ y = <^ y) x <^ y = -^^{x >^ y) x y = -.^(x =^ y) 
max^(x, y) = {x y) (x, y) maxi(x) = x 
max^+i(x,Xjfc) = max^(x,max^(xjk)) 

We can also define functions to manipulate the lengths of strings, namely d'w^kix) that returns 
a string of I's whose length is [1x1/2*^] and a corresponding mod2fc(a:;) function satisfying ix = 
2*^ X div2fc(a;) • mod2fc(a;). 

d\y[{x) = ix div^fc(x) = div^(x^) 
mod2fc(a;) = (2*^ x div2fc(x)) > ix 

(Interestingly, there docs not seem to be a way to define a div^ function for arbitrary k without 
using TRN.) Following this, wc define functions to perform simple bit manipulations on strings 
(extract single bits or substrings, pad to a certain length). 

• "Left bit": \b{x,y) = y {e,^{>y > x)) returns bit number \y\ of x from the left; "right 
bit": rb(x, y) = y?^^(e, (x<l?/<)') returns bit number \y\ of x from the right (both are equal 
to e if y = e or \y\ > jx|). For convenience, we also define lb^(x,y) = \b{x,y) ? (0,0,1) 
and rb^{x, y) = rb(x, y) ? (0, 0, 1) which return or 1 for all arguments. 

• "Left cut" : lc(x, y) = x<\{y\>x) returns the \y\ leftmost bits of x; "right cut" : rc(x, y) = 
(x < y) > X returns the \y\ rightmost bits of x (both return e ii y = e and x if \y\ > |x|). 

• "Left pad" : lpj(x, y) = j{y<ix) -x returns x padded on the left with fs so that |lpj(x, y)\ > 
\y\; "right pad": rpj{x,y) = x ■ j{x \> y) returns x padded on the right with j's so that 
|rpj(x,2/)| > \y\ (both return x if \y\ < \x\). 

• "Left adjust": laj(x,?/) = j{y <\ x) ■ ((x <]y) [> x) returns x either chopped or padded on 
the left so that \\aj{x,y)\ = \y\; "right adjust": raj(x,y) = (x < {y > x)) ■ j{x t> y) returns 
X either chopped or padded on the right so that |raj(x,y)| = 



12 



Chapter 2. The String Algebra Li 



Finally, we have all the functions we need to define a tuple function {{xk)f,) and corresponding 
projection functions (7r^(x)). To form tuples, we simply concatenate the arguments together 
after padding them on the left so that they all have the same length. The projection functions 
are then defined easily using < and ►. One small complication arises because we can only divide 
the length of a string by a power of 2, so we need to form tuples that always have a power of 
2 elements even when there are fewer of them that are actually input values. The definitions 
follow and are inspired by similar definitions in Bloch's paper [B]. (The tuple function is defined 
in terms of an auxiliary function tuple that has an extra parameter specifying the length to 
which each value should be padded.) 

tup\ei{x,z) = lpo(x,2:) 
tuple2fc(ffc, yfc, z) = tuplefc(ffc, z) ■ tuplefc(yfc, z) 
tuple2fc+i(xfc, yfc+i, z) = tuple^+i(e, Xk, z) ■ tuplefc_^i(yfc+i, z) 
{xk)k = tuplefc(xfc, vnaxKxk)) 
T^iiy) = y 

jl-ki^y) if^>^ 

TT^.tiiy^) if^<^ 



-rHy) 



Note that these functions satisfy the following relations: 



Tri{{xi,...,Xk)k) = lpo(x£,maXfc(xfe)), 
^7rf (y),...,7rj(y)^^^ =y 



(unfortunately, (vrf (y), . . . , 7r^(y))^ 7^ y for arbitrary k because of the way the tuple function is 
defined) . 



2.2.3 Generalizations of CRN 

We now introduce a generalization of CRN where the recursion is defined on several variables 
at once. (We assume that xi, . . . , Xm all have the same length, or are appropriately padded on 
the left with O's to make them all the same length.) 

Definition 2.2.1 (CRN^) We say that / is defined from h by CRNm on xi,...,Xm if 
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h{xm,y) e {0, 1} for all Xm,y and 

f{£m,y) = e, 

f{xiii, Xmim, y) = f{xi, Xm, y) ' h{xiii, Xmim, v) for ii = 0, 1; . . . ; C = 0, 1. 

(We can also define left CRN^ similarly to left CRN.) If / is defined from h by CRN^ on 
Xi,... ,Xm, then we can define / using CRN as follows: We will define an auxiliary function 
aux_/ by CRN on a parameter z; this function will mimic the recursion on xi, . . . , Xm by using 
Ic to extract the correct substrings of xi,... ,Xm based on the length of z. Then, / is easily 
defined from aux_/ by COMP. 

aux_f{zi,Xm,y) = auxJ'{z,Xm,y) ■ h{\c{xi,zi),. . . ,\c{xm, zi),y) 

f{xm,y) = aux_/(max^(f^),lpo(xi,max^(f^)),... ,lpo(x^,max^(fm)),y) 

When a function f{x,y) is defined by CRN on x from h, every bit in the output corresponds 
to one bit from x. Now, we will show how to define a function where every bit of x corresponds 
to two bits in the output, and then generalize this to arbitrary values (where every group 
of 2^^ bits in the input corresponds to a group of 2" bits in the output, which we will call 
"2'=-to-2"-CRN", or "(2^=, 2")-CRN"). 

Following the notation mentioned above, we say that a function / is defined from h by 
l-to-2-CRN ({1,2)-CRN) on x if \h{x,y)\ = 2 for all x,y and 

f{e,y) = e, 
f{xi,y) = f{x,y)-h{xi,y) for i = 0, 1. 

(We can also define left (1,2)-CRN.) If f{x,y) is defined from h by (1,2)-CRN on x, we 
can define / using CRN as follows: We will first define an auxiliary function aux_f{z,x,y) by 
CRN on z, to return the \z\ leftmost bits of f{x,y) and then define / from aux_/ by COMP. 
Intuitively, aux_/(z,x, y) uses div2(2;) to determine which bits of x to give as input to h and 
mod2(2;) to determine which bit of h to output next. 

aux_f{zi,x,y) = aux_f{z,x,y) ■ lb''(/i(lc(x, div2(2;) • i) , y) , mod2{z) ■ i) 
f{x,y) = aux_/(x •x,x,y) 

Now, we can introduce the generalization mentioned above. 

Definition 2.2.2 ((2^^, 2")-CRN) We say that / is defined from g and h by 2^-to-T'-CRN 
f(2^,2")-CiZA9 on x if \h{x,y)\ = 2" for all x,y and 

fix,y)=g{x,y) if |x| < 2^ 

/(x ■ z,y) = /(x, y) ■ h{x ■ z, y) for z G {0, 1}^'' . 
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(As before, we can also define left (2'=, 2")-CRN.) If / is defined from g and h by {2^, 2")-CRN 
on X, then we can define / using CRN as follows. (The intuition is similar to that for (1, 2)-CRN 

y) = aux_f{z,x,y) ■ lb^(^/i(lc(x, mod^fe(x) • 2^= x (div^„(z) • i)),y), mod^„(z) • 
f{x,y) = g{\c{x, mod^k{x)),y) ■ aux_/(2'' x d\v^k{x), x,y) 

By combining the two generalizations above, we can show that any function defined by 
"(2^2")-CRN can be defined using CRN and COMP alone, which gives us a relatively 
powerful way to define many more useful functions. 

2.2.4 Boolean functions 

The next functions we will introduce are the Boolean operators, i.e., the standard connectives 
together with some useful functions for comparing bits (these function symbols will be distin- 
guished by putting a superscript next to them). First, we will define a "Boolean test" 
function, which tests for the truth-value of its argument (where a string's truth-value is deter- 
mined by its rightmost bit by convention, with 1 = true and = false — e is treated the 
same way as 0) : x ?^ (y , z) = x ? (z, z, y) is equal to y if x is "true" ; z if x is "false" (according to 
the convention above). Then, ~^x = x ?^ (1)0) returns the truth-value of x and we can define 
the boolean connectives in the usual way. 

-«x = x?«(0,l) X y = X (f««y, 0) x y = x ?^ (1, ««2/) 

^^By^^.B ^^By^ 1) ^^By^^.B ^^B^^ ^B^^ ^^By^^.B ^^By^ ^B^^ 

X >^ y = y -^^ X X <^ y = X y x =^ y = x y 

X <^ y = -.^(x >^ y) x>^ y = -.^(x <^ y) x y = -.^(x =^ y) 

Using CRNm, we can now easily define the following useful functions that perform bitwise 
operations on their arguments. 

not^(xi) = not^(x) • -^i 
andf (xiii, . . . , x^ifc) = andf (xi, . . . , x^) • (n A"" • • • i^) 

orf (xiii, . . . , Xfc4) = orf (xi, . . . , x^) • (zi V"" • • • V"" 4) 
xorf (xiii, . . . , Xkik) = xorfc(xi, . . . , x^) • (n 0^ ■ • • 0^ ik) 

ifff (xiii, . . . , Xkik) = ifffc(2;i, . . . , Xfc) • ((ii i2) A^ • • • A^ {ik-i ik)) 

And following Buss [8j, we can define functions that implement carry-save addition: CScar to 
compute the carry bits and CSadd to compute the addition bits. Note that these functions are 
defined so that CScar(x, y, z, w) + CSadd(x, y,z,w) = x + y + z + w (a fact that will be proved 
rigorously in Chapter [3l) 
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CScars{iiXi,i2X2,hx^) = ((ii A'' 12) {'h is) (is h)) ■ CScar3(xi,a;2,X3) 
CSadd3(xi,X2,X3) = xorf (Oxi, 0x2, OX3) = • xor3(a;i, 0:2, X3) 
CScar(xi, X2, X3, X4) = CScar3(CScar3(a;i, a;2, X3) • 0, CSadd3(xi, X2, ^3), OX4) • 
CSadd(a;i, 2:2, X3, 0:4) = CSadd3(CScar3(a;i, X2, X3) • 0, CSadd3(a;i,X2,X3),0x4) 

2.3 Functions in Li 

In this section, we define many functions in Li and show that some useful generalizations of 
TRN can be simulated in Li. Again, we are motivated by two goals: to define the machinery 
necessary to prove that Li contains all of FNC^, and to show that many useful functions have 
simple definitions in our algebra. 

2.3.1 Basic functions 

Recall that the operation of TRN is restricted in Li so that we cannot define a function by 
TRN from functions that are themselves defined by TRN. Hence, it will be useful to be able 
to define more than one function simultaneously by TRN. 

Definition 2.3.1 (TRN^) The functions fi {I < i < k) are defined from functions gi, 
hi, hi, and hr by TRN^ on x if \fi{x, z,y)\ = ■■■ = \fi^{x,z,y)\ for all x,z,y, and for every 
l<i<k. 



fi{x,z,y) 



9i{x,z,y) ifx = e,0, 1, 

hi{x,z, fi{x<,hi{z),y), fi{>'X,hr{z),y), . . . , 

fk{x<, he{z), y), fk{>-x, hr{z),y),y) otherwise. 



If fk are defined from g^, hk, hi, and hr by TRN^, we can define the fe-tuple F{x,z,y) 
{fk{x,z,y))j^ by TRN as follows. 



F{x,z,y) 



{9k{x,z,y))f^ ifx = £,0, 1, 

hk{x,z,7r'l(Fix<,heiz),y)),7T'l(F{P'X,hr(z),y)), 

■kI.{F{xM, hi{z),y)), Tr^{F{^x, hr{z), y)),y) otherwise. 



Then, a simple composition gives fi{x, z, y) = vrf z, y)) for 1 < i < fe. 

Now, we can define some functions by TRN (actually, by simple TRN). The first two perform 
Boolean operations on all the bits of their input; |.x'| returns the length of x, expressed as a 
binary number; returns |x| copies of y concatenated together (so that = |x| x \y\); 
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cliv^(a;) returns a string whose length is equal to [|a;[//i;J (where we use the notation "mod^,(3;)" 
as a shorthand for the term (k x div^(x)) > ix); and the last two functions are defined by TRN2 
and will be used to count the number of I's in the string x, using the carry-save technique of 
Buss [8] (this will be done below). 



X if X = e, 0, 1 

AND(x-^) A^ AND(^x) otherwise 

X if X = e, 0, 1 

OR(x<^) V'' OR(^x) otherwise 

ix if X = e, 0, 1 

x< > ►x • 0, |x-^| • 1) otherwise 

X (e, y) if X = e, 0, 1 

(x-^ # y) ■ (►x ^ y) otherwise 

X [> /c (ix, e) ifx = e,0, 1 

div^(x-<) • div^(^x) • ((mod^,(x-<) • mod^(^x)) > A;?^^ (l,e)) otherwise 

ox if X = e, 0, 1 

CScar(CAR(x-^), CAR(^x), ADD(x-^), ADD(^x)) otherwise 

X if X = e, 0, 1 

CSadd(CAR(x^), CAR(^x), ADD(x-^), ADD(^x)) otherwise 




x#y 



div^(x) 
CAR(x) 
ADD(x) 



Note that |CAR(x)[ = |ADD(x)[ (easy to show inductively) so the definition by TRN2 is correct. 
Also note that with divj^ and modj^, we can now define "(A:,£)-CRN" similarly to (2^2^)-CRN, 
but for blocks of bits of arbitrary fixed lengths. Interestingly, it does not seem possible to define 
a more general div^ function that would take two parameters, and thus to define a general form of 
CRN where the lengths of the input and output blocks of bits are specified by extra parameters 
(we discuss this issue further in Chapter [3]). 



2.3.2 Numerical functions 

Unfortunately, the fact that functions in Li are length-determined makes it harder to define 
"numerical" functions, i.e., functions that treat their inputs as binary notation for numbers 
(ignoring leading O's). For example, the definition of |x| given above is quite simple whereas 
the definition of Ixj'' given below relies on some more complex functions. 
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Now, we will define a number of "numerical" functions (distinguished by putting a super- 
script next to them). We start with an equality operator for numbers, and also one for 
strings. 

X =^ y = AND(1 • ifFf (x, y)) x =^ y = ^^{x =^ y) {x =~ y) 

Note that the extra "1" is necessary in the definition of for e e to be true, and the 
value of x =^ y is independent of the lengths of x and y, i.e., the function really does behave 
as though its string inputs were binary representations of numbers. Also note that is 
necessary in the definition of =® because of our convention that =^ returns e for "true" and 1 
for "false". 

Next, we define a successor and a predecessor function, both by left CRN. The successor 
function is defined in terms of an auxiliary function that simply replaces each bit by its negation 
until it encounters a 0, which it replaces by 1, and then outputs each bit unchanged {e.g., 11010 
becomes 11011, 1011 becomes 1100, 111 becomes 000). The successor function first adds a to 
the front (left) of its argument before calling the auxiliary function, in case the string consists of 
all I's {e.g.. Ill correctly becomes 1000 and not just 000). The predecessor function performs 
a similar computation, except replacing bits by their negation until it encounters a 1, and first 
checking that the string does not consist of all O's before calling the auxiliary function. 

aux_succ'"'(ia;) = (AND(lx) ?^ {-i^i,i)) ■ aux_succ''^'(x) succ^(x) = aux_succ''^(Oa;) 
aux_pred~(ix) = (0R(2;) ?^ {i, ^^i)) ■ aux_pred~(x) pred^» = OR(x) 7" (aux_pred~(x), x) 

Note that one unfortunate side-effect of the fact that the functions are length-determined is that 
the successor function always appends a bit to the left of its argument. So starting from e and 
applying succ^ repeatedly, we get a series of strings that represent 0, 1, 2, . . . in binary, but whose 
lengths are also 0,1,2,... Next, we define the numerical predicate <'^, which together with =^ 
allows us to define all other relational operators on numbers using the Boolean connectives. 
Note that to define less'', we use Clote's "programming trick" [12j of making a sweep through 
the bits of the strings x and y, appending a 1 when some condition is met so that the final 
composition with OR yields 1 iff the condition was met at some position. Using AND, we could 
similarly define functions that test for some condition on every bit of their inputs. Also recall 
that functions defined by CRN2 (such as less'' below) first pad their arguments on the left with 
O's so they have the same length. 

Iess~(x^, yj) = less~(x, y) ■ {{i j) A« (x =^ y)) x <^ y = OR(less~(x, y)) 

The next function we want to define is bit''(x, z), which returns bit number z of x, starting 
at 1 and counting from the right, where z is interpreted as a binary number. The easiest way to 
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do this is by defining a function pow~(2;, x) that returns a string of length |x| consisting entirely 
of O's except at bit position z (from the right), if 1 <~ z <~ Then, we define a function 
maskbit~(x,y) that treats y as a mask to determine which bit of x to return. 

pow'^ {z,ix) = {jix\ =^ z) ■ po\N^{z,x) 

maskbit~(a;, y) = OR(andf (a;, y)) 
h\t^{x,z) = maskbit~(x, pow~(z,x)) 

Next, we want to define addition. This will require only a few more definitions. First, in order 
to simulate a function that strips leading ones (or zeros) from x, we can define functions firstj(x) 
that return a mask which is 1 on the leftmost bit of x equal to j and elsewhere. We can also 
define a function that returns a mask which is 1 on every significant bit of x (i.e., every bit to 
the right of the first "1" in x) and elsewhere. 

firsti(xi) = firsti(x) • (OR(x) ?^ (0,i)) firsto(a;i) = firsto(x) • (AND(lx) ?^ (-^^^,0)) 
mask^(xi) = mask~(x) • (OR(x) ?^ (l,i)) 

Then, we can define a function which computes the carry bits and an addition function, as 
follows. 

carry~(zx, jy) = maskbit'^(andf(z2:, jy),firsto(xorf(zx, jy))) •carry~(x,y) 
x +~ y = xorf (carry~(x, y) ■ 0, x, y) 

Finally, using the addition function, we can define a function that counts the number of ones in 
a string: sum(a;) = CAR(x) +''ADD(x) and using this function, define the "numerical" length 
of x: \x\^ = sum(mask^(a;)). (Note that we could also have defined \x\ = sum(ia;) instead of 
directly using TRN.) 

2.4 Li and FALOGTIME 

In this section, we prove the following claim. 

Claim 2.4.1 Li = FALOGTIME (= uniform FNC^). 

To be precise, we say that a A;-ary function / belongs to FALOGTIME if there exists an 
integer polynomial pf such that \f{xk)\ < pf{\xi\,...,\xk\) for every Xk, and if the lan- 
guage {{xk,i, b) : the i-th bit of /(x^) is equal to b} is recognizable by an ATM running in time 
C'(max{log|xi|,...,log|xfc|}). 
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2.4.1 FALOGTIME is contained in Li 

To prove that FALOGTIME C Li, we show how to simulate the computation of an ATM 
using functions in Li. Then, if / € FALOGTIME, there exists a term tf & Li such that 
\ifixk)\ = Pfi\xk\), so we can use CRN on tf{xk) to compute each bit of / by simulating the 
ATM on the appropriate input. (Technically speaking, this works only if \f{xk)\ = \tfixk)\ for 
all Xk, but if that is not the case, we can simply use Bloch's idea of "length masks" to compute 
a mask bf{xk) of length tf{xk) that has a 1 in every bit position where f{xk) is defined and a 
elsewhere.) 

Now, without loss of generality, let the ATM have the following properties. 

1. There is a function t G Li such that the ATM runs for no more than = ©(log |i(x)|) 
steps on inputs x (always possible when the ATM runs in logarithmic time since t can use 
# and • to output a string whose length is an arbitrary polynomial in the lengths of the 
inputs). Also, universal states of the ATM are given even numbers and existential states, 
odd numbers. Moreover, we assume that the function t is defined so that \t{x)\ is always 
a power of 2 and 2^1*^^^' is greater than the number of states of the ATM for any inputs x 
(in other words, a string of length 2|t(a;)| is long enough to encode the state of the ATM). 

2. The ATM has n read-only input tapes represented by strings xi,... ,Xn and k worktapes 
represented by pairs of strings , y[, . . . , y^, yj^, each of length exactly 2|i(x)|, where yf 
represents the content of tape number i to the left of the tape head and y[ represents 
the content to the right, with the head scanning the rightmost symbol of yf. Each of the 
three possible worktape symbols (1, 0, or blank) is encoded using two bits (11 for 1, 10 
for 0, and 00 for blank). Initially, the worktapes are blank. 

3. The computation tree of the ATM is a complete binary tree (each non-leaf node has exactly 
two successor configurations, a left successor and a right successor, and every leaf occurs 
at the same level). 

4. Access to the input occurs only at the leaves of the computation tree and is of the form 
"accept iff symbol number yf (interpreted as a binary number) on input tape number j 
is equal to 6", where i, j, and b are encoded in the current state of the ATM. 

Then, if we let CON = (s, 2/1,2/1, • • • > 2/^5 2/fc)2fc+i represent a configuration of the ATM when in 
state s, we can define ZCON and rCON, the left and right successor configurations of CON, as 
follows: 




20 



Chapter 2. The String Algebra Li 



where each of (state, Itape^, Itape^, rstate, rtape^, rtape[ is easily seen to be in Lq, involving only 
simple string manipulations and finite table lookup on the state s (for example, Itapef(cON) = 
di 7" (00 > ((yf < 00 • bi) ■ lc(y[,00)), (00 • yf) < OO) computes the contents of tape i to the 
left of the head in the left successor of CON, where di (the direction of movement for head i) 
and bi (the tape symbol to write on tape i) are obtained from the state and tape contents of 
CON using the conditional function). Moreover, if we let select be defined by (2, 1)-CRN to 
output every second bit of its input string, the function input(cON, x) = h\t'^{xj,se\ect{yf)) 
(where i, j, and b are extracted from the state s) is equal to the accept state of the given input 
configuration and is in Li. Finally, we let CONq = {sq, t{x) # 00, . . . , t{x) # ^0)2^+1 denote the 
initial configuration, where sq is the initial state of the ATM. 

Now, we can easily use TRN to define a function aval that evaluates the computation tree 
of the ATM, so that the result of the entire computation is given by eval(t(5f), CONq, x): 



eval(2:, CON, x) 



input(cON,x) if 2: = £,0,1, 

7r^''+^(cON)' ?^ (eval(2:-^, teON, x) eval(^2;, rCON, x), 

eval(2:-^, teON, x) eval(^2;, rCON, x)) otherwise. 



Note that in the recursive call, 7r^'^'^"^(cON) simply extracts the current state from the given 
configuration, and the rightmost bit of the state number is used to determine whether the con- 
figuration is universal or existential. Also note that this half of the proof is considerably simpler 
than the corresponding proofs in Bloch [6j and Clote \13\ I14j . This seems to be because our 
scheme of TRN encapsulates the sort of computation carried out by ATM's more directly than 
the schemes considered by Bloch and Clote, especially by its use of the parameter replacement 
functions /i£ and hr- 

2.4.2 Xi is contained in FALOGTIME 

To prove that Li C FALOGTIME^ we argue that every function in Lq can be computed by a 
family of circuits in uniform FNC^, and that every function in Li can be computed by a family 
of circuits in uniform FNC^, where we use Bloch's notion of mapping-uniformity, defined in 
which generalizes C/^;* -uniformity to make sense for circuits of constant depth. As will be 
seen, the facts that functions in Lq have constant depth circuits and that functions in Li have 
logarithmic depth circuits are quite simple to prove; the technical difficulties arise mainly from 
uniformity considerations. 

First, we give bounds on the rate of growth of functions in Lq and Li. 

Lemma 2.4.2 For every n-Bry function f €E Lq, there exist constsLnts clq, a,^, . . . , fln G N such 
that 1/(2:1, • • • ,Xn)\ < clq + a{|xi| + ■ ■ ■ + an\xn\ for all strings Xi, . . . ,Xn- 
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Proof The result is proved by induction on the definition of /. 
f 

• If / = e, then % = since |e| = 0. 

• If / = or / = 1, then = 1 since |0| = |1| = 1. 

• If f{x) = ►x, then Oq = 0, a{ = 1 since \>-x\ = \\x\/2] < \x\. 

• If f{x, y) = X < y, then Oq = 0, a( = 1, = since \x <iy\ = \x\ — \y\ < \x\. 

• If f{x,y) = X ■ y, then Oq = 0, a{ = 1,03 = 1 since |x • y| = \x\ + \y\. 

• If f{x, y, z, w) = x7 {y, z, w), then Cq = 0, a{ = 0, 05 = 1, ag = 1, 04 = 1 since 
\x ? < ma.x{\y\, \z\, \w\} < \y\ + \z\ + 

• If f{xi, ...,Xn)= 1'^{xi, . . . ,Xn), then = 0, . . . ,a{_^ = 0,a{ = l,a{^^ = 0, . . . = 
since . . . ,x„)| = \xk\. 

• If / is defined by CRN from h, then y)\ = \x\ so = 0, a{ = 1, 03 = 0, . . . , a{^^ = 0. 

• If / is defined by COMP from g and hi,. . . ,hk, then 

|/(f)| = |5(/ii(x),...,%(x))| 

<ag + a?|/ii(x)| + --- + af|%(f)| 

l<i<fc l<j<n 
l<j<fe l<j<n l<j<fe 

SO 4 = ag + afa^i + • • • + a^ao^ and a{ = afaf^ + • • • + a^af*' for 1 < z < n. □ 

Lemma 2.4.3 For every n-ary function f £ Li, there exists a polynomial pf G N[xi, . . . , x„] 
sucii that \f{xi, . . . ,Xn)\ < . . . , |a;„|) for all strings Xi, . . . 

Proof The result is proved by induction on the definition of /, where we use the notation 
\xn\ to stand for the list |a;i|, . . . , |a;„|. 

• If / G Lq, then by the preceding lemma, pf{\xn\) = + ci{|a;i| + • • • + an\xn\- 

• If / is defined by CRN from h, then as in the preceding lemma, p/(|a;|,|y|) = 

• If / is defined by COMP from g and hi, . . . ,hk, then 

l/(^n)| = \9{hl{Xn), . . .,hk{Xn))\ 

< Pg{\hi{Xn)\, . . . ,\hk{Xn)\) 

< Pg{Phi{Xn), ■ ■ ■ ,Phk{Xn)) 
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(since polynomials in N[x\ are non-decreasing), so pf{\xn\) = Pg{phii\xn\), ■ ■ ■ ,Phk{Wn\))- 
• If / is defined by TRN form g,h,hi,hr, where we assume without loss of generality that 

- \g{x,z,y)\ < Pg{\x\, \z\, \y\), 

~ \he{z)\ < c\z\ and |/ir(-z)| < c\z\, 

- \h{x,z,y,ve,Vr)\ < ao + ai\x\ + a2\z\ +Y.{bi\yi\) + a{\ve\ + \vr\), 

then intuitively, at each level of the recursion, the length of the second argument is 
multiplied by c so that at the bottom level (after lg|x| steps), the second argument has 
length 0{c^^ ^^^\z\). At the same time, the lengths of each recursive call to / are multiplied 
by a, which means that the total length of / (bounded by the length of g in the base case) 
is multiplied by a^^NI. More precisely, we show that 

rig 1x11 -1 rigi^n-i 

Pf{\x\,\z\Ay\) = {(^o + ^Yiklyil)^ ■ (2a)^ + ai|a;| • ^ 

j=0 j=0 

\lg\x\]-l 

+a2\z\- Yl (2ac)J' + (2a)rigNl .p^(i,cngNl|^|jy'|) 

j=0 

< 2|x|rig«l+i [ao + Y^b,\y,\) + ai\x\ + asl^l |x| ^§^1 
+a-pg{l,c\z\\x\\'^'^\\y\) . 

Technically speaking, we need to use max{l,|2;|} everywhere that \z\ appears in this 
expression, but this does not change the proof substantially besides making it longer to 
write down. Also, we need to deal separately with special cases such as when a = or 
when the lengths of and hr are constants independent of \z\, but all of these cases 
simplify the proof so we present the general case only. 

Now, if X = e,0,l, then \f{x,z,y)\ = \g{x,z,y)\ < pg{\x\,\z\,\y\) < pg{l,\z\,\y\) < 
ao + ai|a;| + a2\z\ + ^ibi\yi\) + Pg{l, \z\, \y\) = p/(l, \z\, \y\). If |a;| > 1, then we consider 
two subcases. If |x| is even, then [lg(|a;|/2)] = [Ig |a;|] — 1, so 

\f{x,z,y)\ = \h{x,z,y,f{x<,he{z),y),f{>'X,hr{z),y))\ 

< ao + ai|a;| + a2\z\ + ^(6i|2/j|) 
+a{\f{xM,he{z),y)\ + \f{^x,hr{z),y)\) 

< ao + ai\x\ + a2\z\ + ^(6j|yi|) 
+a{pf{\x<\, \hi{z)\, \y\)+pf{\^x\, \hriz)\, \y\)) 

< ao + ai|x| + a2\z\ + ^(6i|yi|) + 2apf{\x\/2,c\z\, \y\) 
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< ao + ai\x\ + a2\z\ 

rigkn-2 I I rigi^n-2 

+2a [ao + J2ibi\yi\))- E i^^Y + ' ^ 

j=0 j=0 

[Ig 1x11-2 

+a2c\z\- Yl (2acy + (2a)r'gNl-i.p^(i^crigNl-ic|z|,|^|) 

<p/(kl> kl> 

If |x| is odd, then [lg((|a;| - l)/2)] < [Ig \x\] - 1 and rig((|x| + l)/2)] = [Ig - 1, 
\f{x,z,y)\ = \h{x,z,y, f{x'4,hi{z),y), f{>'X,hr{z),y))\ 
<ao + ai\x\ + a2\z\ + 

+a{\f{xM,he{z),y)\ + \f{P^x,hriz),y)\) 

< ao + ai|x| + 02!^! + y^^{bi\yi\) 
+a{pf{\x<\, \he{z)\, \y\) +Pf{\*'x\, \hr{z)\, \y\)) 

<ao + ai\x\ + a2\z\ + 

+a{pf{{\x\ - l)/2,c|z|, !y|) + l)/2,c\z\ 

< ao + ai|x| + a2\z\ + 



+a 



x — 1 



[igi,Tn-2 

+a2c|z|- Y (2ac)^' + (2a)rigl^n-i.p^(i,crigl^n-ic|z|,|^|) 

j=0 



+a 



rigkn-2 

E 



rig 1^11 -2 



j=0 



rigkn-2 



+a2c\z\- Yl (2ac)^ + (2a)^^sl^n-i.p^(i^crigNl-ic|^|,|^|) 

j=0 

< ao + ai|x| + a2|^;| + 

[igkii-i rigi^n-i 

j=i 3=1 

llg\x\-]-l 

+a2\z\- Y (2acy + (2a)rigNl .p^(i^JigNl|^|^|^|) 

rigia;n-i rigi^n-i 
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Now, we are ready to discuss circuits. For the sake of completeness, we summarize here 
Bloch's definitions and results, suitably modified to apply to our setting. We will be working 
with circuit families that compute /unctions instead of relations, i.e., circuits will generally have 
multiple output gates. In this setting, we define FNC^ to be the class of functions computed 
by constant depth circuit families (because the circuits have multiple output gates, this class 
contains interesting functions, unlike the relational counterpart NC^), and FNC^ to be the 
class of functions computed by logarithmic depth circuit families. 

We assume that the gate set for our circuits consists of constants and 1, unary identity 
(~) and negation (-■), and binary conjunction (A), disjunction (V), left projection (tt^), and 
right projection (tth) (this could be reduced at the cost of longer proofs). Given a circuit family 
composed of such gates, we identify gates in the circuits by pairs (out, a), where OUT is the 
number of an output gate of the circuit (in binary) and a € {L, R, S}* represents a path in the 
circuit from the given output gate, where L and R indicate the left and right inputs of a binary 
gate, respectively, and S indicates the only input of a unary gate. (Note that we number the 
output gates from right to left, starting with 1, and similarly for the input gates of each input.) 

Now, we want to work with uniform families of circuits. Unfortunately, the standard notion 
of [/£;• -uniformity defined by Ruzzo [28] does not make sense for constant depth circuits (it 
would require the extended connection language for the circuits to be recognizable by an ATM 
in constant time, which is not even enough time for the ATM to examine a gate number or 
an input length). To remedy this, Bloch defines a notion of mapping-uniform circuits, where 
the uniformity computation is divided in two phases. The main purpose of the uniformity 
computation is to be able to recognize connections in the circuit {i.e., given a gate and a path, 
what gate is at the end of the path?) and gate information (i.e., given a gate number, what 
type is that gate?). Because of our numbering scheme for gates, determining the descendant of 
a gate (out, a) along a path a' is easy: the answer is simply (out, aa'). Determining the type 
of an internal gate is also not difficult, as we will see. The hard part, requiring the "two-phase" 
approach, is to determine which input bit is tied to an input gate (which is necessary to fully 
specify the "type" of that input gate). 

Here is some intuition behind the two phases of the uniformity computation. Essentially, 
given an input gate (out, a) in a circuit, the first phase must use a to identify how to compute 
the number of the input bit as a function of the output gate number, but without carrying 
out that computation. This will be done by a deterministic Turing machine running in time 
proportional to |cj|, which is bounded by the depth of the circuit, and it is this phase of the 
computation that will be composed or iterated for functions defined by composition or recursion. 
The second phase of the computation will only be carried out once, by an ATM that combines 
the information from the first phase together with the rest of the information about the circuit. 
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More precisely, the first phase will output a term that may be thought of as mapping 
output bit positions to input bit positions that "affect" that output bit, i.e., given an input 
gate (out, (t) in the circuit tied to bit number r of some input parameter, we want to- (out) = r. 
For this purpose, we introduce the mapping language of a family of circuits, which consists of 
a set of functions that encapsulate all the "primitive" dependencies that may exist between 
output and input bit numbers. Besides the natural numbers used to represent bit positions, we 
also use the symbol "±" to indicate that a given output bit does not depend on any input bit. 
For a circuit with k input parameters yi, . . . ,yk, the mapping language contains the following 
function symbols (each function is implicitly defined to be equal to _L when its argument is _L). 

• undef(x) = ± • ADDj(x) = X + \yj\ • MlNj(a:;) = min{a;, \yj\} 

• one(x) = 1 • svBj{x) = x - \yj\ 

Before we move back to circuits, we argue that terms in the mapping language can be 
computed in alternating logarithmic time (as a function of m = max{|to-|, OUT, . . . , |j/jt|}). 
Given t^, out, . . . , an ATM can check in parallel if to- contains UNDEF and output ± 
immediately if this is the case; otherwise, the ATM guesses the position in where the last 
"one" appears, and replaces the subsequent part of the term with 1. Once these simple checks 
are done, the ATM can construct lists of numbers from . . . , and subterms of to for each 
block of functions of the form MlNj^ (. . . MlNj^(t') . . . ) in to-) and evaluate each of these blocks in 
parallel. The rest of the subterms will contain only ADDj and SUBj functions, and these lists of 
terms and numbers from |yi|, . . . , can be added and subtracted (using two's complement) 
with standard carry-save techniques. 

Now, following Bloch, we say that a circuit family is mapping-uniform if there exist a 
deterministic multi-tape Turing machine P and an ATM Q such that for every gate (oUT, a) in 
a circuit of the family, the following two conditions hold. 

1. If (out, cr) is an input gate tied to bit number r of some input parameter, then P on 
input a runs in time 0{\a\) and outputs a term to in the mapping language such that 
to(OUT) = r. 

2. Machine Q on input OUT, cr, |yi|, . . . , |yjfc|,r runs in logarithmic time, i.e., in alternating 
time 0(log(max{oUT, \a\, \yi\, . . . , \yk\})) and accepts iff (oUT, cr) is an internal gate of 
type r or (oUT, cr) is an input gate tied to bit number r of yj and r = {j, r) (in some 
standard encoding). 

A direct argument shows that any mapping-uniform family of circuits of at least logarithmic 
depth is also C/E*-uniform. Now, we are ready to show that Li C FNC^. 
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Lemma 2.4.4 For every f G BASE, f can be computed by a uniform circuit family in FNC^. 



Proof 

e: The empty circuit is the only one computing this function. Hence, on input a = e, P outputs 
undef(x); on input OUT, a, r, Q accepts iff OUT = 0, a = e, and t = e. 

0, 1: Circuits for these functions consist of a single output gate, one of the "constant" gates 
or 1, appropriately. Hence, on input a = e, P outputs undef(2;); on input OUT, cr,r, Q 
accepts iff OUT = 1, a = e, and r = or r = 1, respectively. 

►yi: Circuits for this function connect output gates to input gates directly, using unary identity 
gates Hence, on input a = S, P outputs simply x; on input OUT, cr, |yi|,r, Q accepts 
iff 1 < OUT < [|yi|/2], a = e and r = ?a, or cr = 5 and r = (1, OUT). 

yi < y2' Again, circuits for this function connect output gates to input gates directly, us- 
ing unary identity gates ~. Hence, on input a = S, P outputs add2(x); on input 
OUT, (T, |yi I, |y2|i ''"i Q accepts iff 1 < OUT < |yi| — 1^21, c = e and t = ^, or a = S and 
r = (l,r) for r = add2(out). 

Vi ' y2' The simplest circuits to compute this function would use unary identity gates connected 
directly to the input bits, as in the last two cases. Unfortunately, this would not allow P 
to know from a alone which term to output. Therefore, we do something slightly different, 
as depicted in Figure [2.4. 1[ 



O O 



o o o 



o 



yi 



y2 



Figure 2.4.1: Uniform circuits for the concatenation function 

Now, on input a = L, P outputs SUB2(2;), while on input a = R, P outputs simply x; on 
input OUT, cr, \yi\, \y2\,T, Q accepts iff 



1 < OUT < |i/2| and 

— a = £, T = TTfl, or 

— fj = L, T = 0, or 

— a = R, T = {2, out); or 



\y2\ + 1 < OUT < \y2\ + \yi\ 

— a = e, T = TTi^, or 

— a = R, T = 0, or 

— a = L, T = {l,r) 
for r = SUB2(out). 



and 
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yi ^ iy2ry3^y4:)' This is the only base function requiring circuits of depth greater than one. 
There is one consideration making the circuits sUghtly more complicated than it would 
seem necessary at first: the shorter of the last two input parameters must be "padded" to 
the same length as the longer, requiring some extra gates. So, the circuits for ? are of two 
different kinds: when \yi\ = 0, the circuits simply use unary identity gates ^ for output, 
connected directly to the input gates of 7/2, while if > 0, the circuits are depicted 
in Figure \2A.2\ (we illustrate the case when < {ysl < {y^l; the other cases are identical 
except for the obvious modifications to the types of the projection gates). 




Figure 2.4.2: Uniform circuits for the conditional function "?". 

Now, there are only a constant number of possibilities for a that machine P needs to 
check. We list each one and the corresponding output for P, as well as a brief explanation 
indicating which bit of which input parameter is designated by the given path a, in 
Table EXH 



a 


output 


explanation 


S 


X 


same bit of 2/2 


RL 


one(x) 


first bit of yi 


LIS 


ONE(a;) 


first bit of yi (negated) 


LRR 


MIN3(x) 


bit of ys (padded) 


RRR 


MIN4(x) 


bit of 2/4 (padded) 



Table 2.4.1: Behaviour of machine P for the conditional function. 

Next, on input OUT, o", \y2\, {ysl, |y4|,T, Q accepts iff 

• = 0, 1 < OUT < \y2\, and 

— cr = e, r = Ri, or 

- a = S, T = {2, out); or 
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• \yi\ > 0, 1 < OUT < max{|y3|, |y4|}, and 

— cr = £, r = V, or 

— cr = L, T = A, or 

— a = R, T = A, or 

— a = LL, T = -I, or 

— cr = LR, T = tTr if OUT < |y3|, r = tTi, if |y3| < OUT, or 

— cr = RL, T = (1, 1), or 

— cr = r = TTjj if OUT < |y4|, r = tTl if |y4| < OUT, or 

— a = LLS,T= (1,1), or 

— cr = r = 0, or 

— a = LRR, T = (3, r) for r = MIN3(out), or 

— a = RRL, T = 0, or 

— (7 = RRR, T = (4, r) for r = MIN4(out). 

^k{y\i • • • iVn)' Circuits will use unary identity gates !=a directly connected to the proper 
input bits. Hence, on input a = S, P outputs simply x; on input OUT, cr, |yi|, . . . , r, 
Q accepts iff 1 < out < \yk\, a = e and r = or cr = 5 and r = {k, OUT). □ 

Next, we want to show that functions defined by CRN also have uniform circuit families. 
For technical reasons {i.e., to simplify the proof), wc actually show the result for left CRN. 
(Since Lq and Li remain the same whether CRN or left CRN is used to define them, this is 
sufficient.) 

Lemma 2.4.5 If f{yi,y2, ■ ■ ■ , yn) is dcGncd from h hy left CRN on yi, where h has uniform 
circuits of depth d/j(|yi|, |y2|, • • • , |yn|)) then f has uniform circuits of depth 

max {dh{l, |y2|, ■ ■ ■ , \yn\),dh{2, |y2|, ■ ■ ■ , \yn\), ■ ■ ■,dh{\yi\, |y2|, • • • , |yn|)}- 

Proof A natural circuit for / consists of a series of /i-circuits in parallel, one for each output 
bit of /, where the i-th /i-circuit is connected to the first i bits of yi (as well as to every other 
input parameter). Clearly, the depth of this circuit is as stated above. Moreover, given a path 
cr, machine P simply simulates P/j to get a term and outputs ta{x) = t^(ONE(a;)), while 
machine Q accepts OUT, cr, |yi|, |j/2|, ■ ■ ■ , |yn|) iff = 0, OUT = 0, cr = e, r = e, or Q/^ accepts 

out' = l,(7,|yi| =OUT,|y2|,-..,|yn|,T. □ 

Following this, we need to show that the composition of functions computed by uniform 
families of circuits is also computable by uniform families of circuits, of the right depth. 
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Lemma 2.4.6 If /(yi, . . . , yn) is defined from g and hi,. . . ,hk by COMP, where g has uni- 
form circuits of depth dg{\zi\, . . . ,\zk\) and hi has uniform circuits of depth . . . , 
(for 1 <i < k), then f has uniform circuits of depth 

dg{\hi{yn)\, \hkiyn)\) + max{4.(|y„|)} + [lg(fc)l. 

Proof A natural circuit for / consists of a circuit for g whose input gates arc connected to 
the output gates of the corresponding circuits for hi directly. Unfortunately, machine P cannot 
tell what term to output just from a path in such a circuit, because P does not have enough 
time to determine which hi the path leads into (this would require determining the number of 
the input gate of g through which the path passes). Therefore, we construct a slightly more 
complicated circuit by adding a layer of selection gates of depth [lg(A;)] between the circuit for 
g and the hi circuits (somewhat like what was done for the concatenation function), in such 
a way that machine P can easily determine from a path a which hi the path goes through. 
Clearly, the depth of such a circuit is as stated above. Moreover, given a path a, machine P 
can break it up into ag through g (getting a term t^^), followed by a' through the selection 
subcircuit (which gives the index i of the function hi feeding into the g circuit), followed by 
a final part Uj through hi (getting a term to-J- P then outputs taiitagix)), in time linear in 
\a\. Also, on input OUT, a,\yi\, . . . , r, machine Q accepts iff (oUT, a) is a gate of type r 
in g (by simulating Qg), or (oUT, o") is a gate within [lg(A;)] steps of an input of g and r is 
the correct type of projection gate (computing tg-g (out) to figure out which input bit of g the 
path a goes through, and then checking the constantly many possibilities for the part of a 
through the selection subcircuit), or (oUT, a) is a gate of type r or an input gate r = {j, r) for 
r = tcr- (tcTg (out)) within an hi circuit (computing to-g(oUT) and tracking the path through the 
selection subcircuit to figure out the index i, and then simulating Q^. to verify r). All this can 
be done in logarithmic time. □ 

Finally, we show that functions defined by TRN can be computed by uniform families of 
circuits. 

Lemma 2.4.7 If f{yi,y2,y3, ■ ■ ■ ,yn) is deRned from g, h, h(, and hr by TRN, where g has 
uniform circuits of depth dg{\yi\, \y2\, {ysl, ■ ■ ■ , \yn\), h has uniform circuits of depth dh (a con- 
stant), and hi and hr have uniform circuits of depth d^ and dr (constants), then f has uniform 
circuits of depth 

0{dg{l, |y2||yir, lyal, • • • , \yn\) + log \yi\ ■ (dh + di + dr)) 
for some constant c G N. 



30 



Chapter 2. The String Algebra Li 



Proof A natural circuit for / consists of a binary tree of /i-subcircuits connected to the 
appropriate bits of the first input and to each other, with a layer of g circuits at the bottom, 
where the second input of the h and g circuits is connected to subtrees of hi and hj. circuits. 
As in the proof for composition, we use layers of selection gates to "glue" together the different 
subcircuits (between successive h circuits, between h and g circuits, between h oi g circuits and 
the circuits to compute left and right halves of the first input or left and right functions of the 
second input, as well as between successive "half" functions for the first and second inputs), so 
that P can tell from the path a alone which subcircuit a path leads to. The total depth of such 
a circuit is obviously as stated. Moreover, given a path cr, P can divide the path into portions 
through h circuits, accumulating the terms for each one, and the final portion of the path 
through an /i, g, hi, hr, or "half" circuit, outputting the composition of each term. This can 
obviously be done in linear time in the length of a. On input OUT, a, \yi\,\y2\,\y3\, ■ ■ ■ , \yn\,T, 
machine Q can break up the path a into a first part through some number of h circuits and a 
final part a' entirely contained inside some h, g, hi, hr, or "half" subcircuit. Q can compute the 
term corresponding to the first part of the path to figure out which output bit of the subcircuit 
a passes through, and then simulate the ATM for that subcircuit to verify r, all in logarithmic 
time. □ 

Now, we can put all of these results together. 

Theorem 2.4.1 For all f Lq, f can be computed by a uniform family of circuits in FNC^ 
(i.e., of constant depth). 

Proof By induction on the definition of /: Lemma 12.4.41 shows that functions in BASE 
have uniform constant depth circuits, and Lemmas 12.4.51 and 12.4.61 show that CRN and COMP 
preserve uniform constant depth. □ 

Theorem 2.4.2 For all f ^ Li, f can be computed by a uniform family of circuits in FNC^ 
(i.e., of logarithmic depth). 

Proof By induction on the definition of /. 

• If / E Lq, then the preceding theorem shows the result. 

• If / is defined by CRN from /i E Li, then Lemma 12.4.51 shows that / can be computed by 
uniform circuits of the same depth as that of the circuits for h, which shows the result. 



• If / is defined by COMP from g,hi, . . . ,hk E Li, then Lemma [2. 4. 61 and Lemma [2.4.3l (on 
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the length of functions in Li) show that / can be computed by uniform circuits of depth 

dg{\hM\. • • • , \hm\) + max{4,(|y|)} + \\g[k)^ 

= o(\og{m&yi{ph^{\y\), . . . + log(max{|y|})) 

= 0(log(max{|y|})). 

If / is defined by TRN from g ^ Li and h, h£,hr S Lq, then Lemma |2.4.7^ together with 
Lemmas 12.4.21 and 12.4.31 on the length of functions in Lq and Li, shows that / can be 
computed by uniform circuits of depth 

0{dg{l, \y2\\yi\'', Ivsl, • • • , \yn\) +log|yi| • (4 + de + dr)) 

= 0(log(max{|y2|,- • • , \yn\}) +log|yi|) 

= 0(log(max{|yi|, [2/2!, • • • , \yn\}))- □ 



Chapter 3 

The Quantifier- Free Theory Ti 



In this chapter, we will define the theory Ti and give its formal development, including many 
proofs of simple properties of functions of Ti, as well as many derived rules, and concluding 
with an illustrative example by proving the pigeonhole principle. 

3.1 Definitions 

The theory Ti that we now describe is a quantifier- free system, i.e., a free- variable theory with 
propositional connectives, modeled after Cook's PV [18] but based on the algebra Li. The 
language of Ti consists of the function symbols 

{e,0,l,o,i,'^,^,-, >,<,?}, 

the function constructors {A, ^CRN, rCRN, TRN}, the predicate symbol {=}, and the usual 
propositional connectives {^,A,V,— More precisely, we have the following definitions 
(where we use the informal notation xO for (x • 0) and Ox for (0 • x) — similarly for xl and Ix). 

Definition 3.1.1 The function symbols and terms of Ti are defined as follows. (The in- 
tended interpretation of each function symbol is as given in Chapter [21 where ^CRN represents 
"left" (or "reverse") CRN and rCRN represents "right" (or "plain") CRN, and we use the no- 
tation introduced there instead of the more formal prefix notation. Also, each function symbol 
and each term has a rank of either or 1 — that intuitively indicates which one of Lq or Li the 
function symbol or term belongs to.) 

1. Each variable xq, xi, X2, ... is a term of rank 0. 

2. If / is an n-place function symbol and ti, . . . ,tn are terms, then f{ti, . . . ,tn) is a term 
whose rank is the maximum of the ranks of f,ti, . . . ,tn {i.e., the rank of f{ti, . . . , tn) is 
iff the rank of each one of /, , . . . , is 0) . 
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3. £, 0, 1 are 0-place function symbols (constants) of rank 0. 

4. 0, 1, -^j ► are 1-place function symbols of rank 0. 

5. •, >, < are 2-place function symbols of rank 0. 

6. ? is a 3-place function symbol of rank 0. 

7. If t is a term and list of variables including all the variables in t, then 
[Xxi . . .Xn-t] is an n-place function symbol of the same rank as that of t. 

8. If h is an (n + l)-placc function symbol, then ^CRN[/i] and rCRN[/i] are (n + l)-place 
function symbols whose rank is that of h. 

9. If g is an (n+2)-place function symbol, h is an (n+4)-place function symbol of rank 0, and 
he and hr are 1-place function symbols of rank 0, then TRN[5, h, he, hr] is an (n + 2)-place 
function symbol of rank 1. 

Definition 3.1.2 The axioms of Ti arc as follows (except for the propositional and equality 
axioms, they simply define the function symbols). 

0. Any standard, complete set of axioms for the propositional calculus (with equations of 
the form x = y in place of propositional atoms, for arbitrary variables x and y). 

1. (a) X = x 

(b) x = y y = x 

(c) {x = y f\y = z) — > X = z 

(d) (xi = yi A • • • A Xfe = yfc) f{xi, ...,Xk) = f{yi, ...,yk) 
(for all fc-ary function symbols /, for all A; > 1) 

2. e^OAO^^lAl/e 

3. (a) X ■ s = X A X ■ yO = {x ■ y) ■ A x ■ yl = {x ■ y) ■ 1 

(b) X ■ y = £ {x = e Ay = e) 

(c) X ■ y = (x = eA2/ = 0)V(x = 0Ai/ = e) 
X ■ y = 1 {x = eAy = l)\/{x = lAy = e) 

4. (a) e> X = X A Oy>x = 0>{y>x) A ly>x = l>{y>x) 

(b) 0>£ = £ A 0\>0x = X A 0>lx = X 
1>£ = £ A 1\> Ox = X A l>lx = a; 

(c) y > X = e X \> yO e x \> yl s 
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5. (a) X = X <\e A (x < y) <l = x < yO A {x < y) <l = x > yl 

(b) e = £<\Q A 2; = xO<lO A x = xl<lO 
e = e <l A 2; = xO<ll A x = xl <1 

(c) e ^ \y <\x e ^ Oy <\ X e = x <\y 

6. (a) = e /\ q{x^) = qx-0 A q{x\) = qx ■ Q 
(b) le = e A i(xO) = ix-l A i(xl) = ix-l 

7. el (x, y, z) = X A ? (x, y, z) = q{z <\ y) ■ y A wl? {x, y, z) = o{y <\ z) ■ z 

8. (a) (x-^) • (►x) = X 

(b) (x-^ < ►x) = e A 1 > (x-^ > ►x) = e 

9. We use "x (y, z)" as a shorthand notation for (x-^ > ►x) ? {y, z, z) (which equals y if 
the length of x is even, z if the length of x is odd). 

(a) (xO)-^ = X (x-^, x-^ • ((►x • 0) < ►x)) A (xl)-^ = x (x-4, x-^ • ((►x • 1) < ►x)) 

(b) ►(xO) = X (►x • 0, 1 [> (►x • 0)) A ►(xl) = X T'^^ (►x • 1, 1 > (►x • 1)) 

(c) (Ox)^ = X ((0 • x^) < 1, • x^) A {lx)< = X ((1 • x-^) < 1, 1 • x<) 

(d) ►(Ox) = X ((x-^ > (0 • x-^)) • ►x, ►x) A ►(Ix) = X ((x-^ > (1 • x-^)) • ►x, ►x) 

10. [Axi . . .x„.t](xi,. . . ,x„) = t 

11. (a) ^CRN[/i](e,y) =e 

A ^CRN[/i](Ox,y) = ((/i(Ox,y) -0) </i(Ox,y)) • ^CRN[/i](x, y) 
A £CRN[/i](lx,y) = ((/i(lx,y) -0) </i(lx,y)) • ^CRN[/i](x, y) 

(b) rCRN[/i](e,y) =e 

A rCRN[/i](xO,y) = rCRN[/i](x, y) • (/i(xO,y) > (0 • /i(xO,y))) 
A rCRN[/i](xl,y) = rCRN[/i](x, y) • (/i(xl,y) > (0 • h(x\,y))) 

12. TRN[y,/i,/i^,/i,](x,z,y) = x<l?(5(x,z,y),t,t) 

where t = h{x, z, y, TRN[5t, h, he, hr]{x<, he{z),y),TRN[g, h, he, hr]{>-x, hr{z), y)) 

Remark 3.1.1 By Claim 12.4. H every function in FALOGTIME is represented by some 
function symbol in Ti, and every function symbol in Ti represents a function in FALOGTIME. 

Definition 3.1.3 The rules of inference of Ti are as follows. 

0. Any standard, complete set of rules for the propositional calculus. 
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1. Substitution for an arbitrary formula A, variable x, and term t: 



A h A[t/x] 



2. Induction on Notation (NIND) for an arbitrary formula A and variable x: 

(a) ("left" version) A[e], A[x] ^ A[Ox], A[x] ^ A[lx] h A 

(b) ("right" version) ^ ^[xO], ^ h A 

3. Tree Induction (TIND) for an arbitrary formula A, variables x, z, and unary function 
symbols he, hr of rank 0: 



3.2 Developing the theory 

In this section, we give a formal development of Ti, starting with a few simple theorems and 
working our way towards multi-variable versions of CRN and NIND. These will be used to 
define binary addition and "counting" functions, and to prove their properties. 

3.2.1 Basic definitions and theorems 
Claim 3.2.1 e ■ x = x 

Proof By NIND on x: e ■ e = e (by Axiom [3al and Ruled]), 



A[e,z],A[0,z],A[l,z],{A[x'*,he{z)] AA[>-x,hr(z)]) ^ A[x,z] h A 



e ■ xO = {e ■ x) ■ 



(Axiom [3al Rule [I]) 



= X • 



(Induction Hypothesis, Axiom \Td\ Ruled]) 



e ■ xl = {e ■ x) ■ 1 



= X ■ 1 



(Axiom Eil Ruled]) 

(Induction Hypothesis, Axiom [Td] Ruled])- □ 



Claim 3.2.2 x ■ {y ■ z) = {x ■ y) ■ z 



Proof By NIND on z: x ■ {y ■ e) = x ■ y 



{x ■ y) ■ e (by Axiom [3a| and Rule d]) 



X ■ {y ■ zO) = X ■ {{y ■ z) ■ ti) 



(Axioms Eaj and dd] Rule d]) 



= (x • (y • z)) • 
= ((x • y) • z) • 
= (x • 2/) • zO 



(Axiom [3a| and Rule d]) 
(Induction Hypothesis) 
(Axiom [3a| and Rule d]) > 



and a similar proof shows x ■ {y ■ zl) = {x ■ y) ■ zl. □ 
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Note that in the proofs that follow, we will not mention explicitly the application of partic- 
ular axioms, of the induction hypothesis, or of the substitution rule when they are self-evident. 
Also, when proving a statement by NIND, the cases for xO and xl will often be almost identical 
(as above) so we will prove both cases at once using "z" to stand for either or 1. 

Remark 3.2.1 Be advised that the rest of this section contains a large number of technical 
claims, together with their proofs, which are included here for the sake of completeness. Most 
of these claims are of limited interest in themselves, apart from illustrating the style of proofs 
in Ti and giving basic properties of functions which will be used in later proofs. For this reason, 
we recommend that on a first reading, the reader focus mainly on the Definitions, Theorems, 
and Derived Rules, which contain the essential results. 

On ">" and "<" 

We start by defining two functions that will serve as a convenient shorthand notation throughout 
the rest of this chapter, and prove basic properties of these functions. 

Definition 3.2.1 (L) > = [Xx.l\>x] (R) < = [Xx.x<il] 

(To make the notation more consistent with previous usage, we will write ">x" and "a;<" instead 
of the more formal ">(x)" and "<(a;)", respectively.) 

Claim 3.2.3 (L) 0\>x = l>x (R) x<il = x<iO 

Proof (L): Immediate from Axiom [4bl (R): Immediate from Axiom [5bl □ 

Claim 3.2.4 (L) >e = e A>{Ox) = x A>{lx) = x (R) e< = e A {xO)< = x A {xl)< = x 

Proof (L): This is just a restatement of Axiom I4b[ (R): This is just a restatement of 
Axiom I5b[ □ 



Claim 3.2.5 (L) y>e = e (R) e = e<\y 

Proof (L) By NIND on y: £\>e = e,iy>£ = i>{y>e)=i>s = e. (R) By NIND on y: 

e[>£ = £,£<\yi = {e<\y)<\i = £<\i = £. □ 

Many of the theorems that follow will be similar to the ones above in having a "left" and 
"right" version, both of which can be proved in the same way (by using the appropriate version 
of NIND when necessary). Hence, to avoid unnecessary repetition, we will only give the proof 
of one version from now on. 
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Claim 3.2.6 (L) >{y \> x) = y > >x (R) {x <\ y)< = x< <\ y 

Proof (L) By NIND on y: >(e [> x) = >x = e > >x, >{iy \> x) = »{y \> x) = >{y > >x) = 
iy > >x. □ 

Claim 3.2.7 (L) zy \> zx = y > x (R) xz<yz = x<y 

Proof (L) By NIND on z: ey\> ex = y> x, {iz)y > {iz)x = >{zy [> {iz)x) = zy > >{iz)x = 
zy \> zx = y \> X. □ 

Corollary 3.2.8 (L) x>xy = y (R) y = yx<x 

Corollary 3.2.9 (L) x\>x = e (R) e = x < x 

Now, we are ready to define two more functions tliat will also be used as a convenient 
shorthand notation for the rest of the chapter. 

Definition 3.2.2 (L) ^ = [Xx.x<\>x] (R) ' = [Xx.x<\>x] 

(To make the notation more consistent with previous usage, we will write "V and "x"' instead 
of the more formal "^(x)" and '"(x)", respectively.) 

Claim 3.2.10 (L) 'e = e A\Ox) = A\lx) = I (R) e' = e A (xO)' = A (xl)' = 1 

Proof (L) From the definition, by Claim[32]l]and by Corollary l3.2.8l V = £<!>£ = £<\e = e, 
^{ix) = ix <\ >{ix) = ix <\ x = i. □ 

Claim 3.2.11 (L) V • >x = x (R) x = x< • x' 

Proof (L) By NIND on x: ''e ■ >e = e ■ e = e, \ix) ■ >{ix) = i ■ x = ix. □ 
On "•" 

Claim 3.2.12 (L) e / ix (R) xi / e 

Proof (L) By Axioms [2] and I3bl and by taking the contrapositive: xi = s^x = eAi = 

e = e. □ 

Claim 3.2.13 (L) Ox / ly (R) xO / yl 

Proof (L) By Axiom [21 Claim I3.2.10[ Axiom \ld\ and by taking the contrapositive: Ox = 
ly^\Ox) =\ly)^0 = l. □ 
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Claim 3.2.14 (L) >x = e^x = e\/x = 0\/x = l (R) x = e\/x = 0\/x = l^e = x< 
Proof (L) By NIND on x: >e = e s = e, >{ix) = e x = e ix = ie ^ ix = i. □ 
Theorem 3.2.1 (L) x = eVx = 0->xVx = l->x (R) x = ey x = x<-Q\/ x = x<-l 
Proof (L) By NIND on x: e = e, ix = i ■ >{ix). □ 

Note that by Claims 13.2.121 and 13.2.131 we can show that exactly one of the disjuncts holds 
(i.e., that x = e^x^{)->xf\x^ l->x and x = d->x^x^ef\x^l->x and 
x = l->x^x^Q->xf\x^e). 

Corollary 3.2.15 (Lj x / e^x = 0->xVx = l->x (Rj x / e^x = x<-OVx = x<- 1 

Note that Theorem 13 . 2 . 1 1 can easily be generaUzed to show, for example, x = eVx = OVx = 
1 V X = 00 • >>x V X = 01 • »x V X = 10 • >>x V X = 11 • »x, or, by substituting various terms 
for X, xi = z V xi = • {>x)i V xi = 1 • (>x)i, etc. Because {Ay B) h {A^ C) ^ {B ^ C) ^ C is a. 
theorem of Ti , we can use Theorem 13.2.11 together with substitution to define an entire family 
of "derived rules" in Ti, like the following. 

Derived Rule 3.2.1 

1. (L) A[e], A[{)x\, A[lx] h A (R) A[e\, A[x% A[xl] h A 

2. A[e],^[0], A[l],yl[OxO], A[0xl], A[lxO], A[lxl] h A 

As an example of application of Derived Rule 13.2. H we prove the following simple claim. 

Claim 3.2.16 (>x)< = >(x<) 

Proof By Derived Rule 13.2.11 (>e)< = e = >(e<), (>i)< = £< = >£ = >(i<), {>{ixj))< = 
(xj)< = X = >{ix) = >{{ixj)<). □ 

On "o" and "i" 

Claim 3.2.17 o{x ■ y) = qx ■ oy 

Proof By NIND on y: Q^x-e) = qx = ox-e = qx-qE, o{x-yi) = o{x-y)-0 = ox-oy-0 = ox-o(yi). 
□ 

Note that an identical theorem can be proved with ix in place of qx. In what follows, we will 
often need to prove theorems in which "q" or "i" appear, where the particular function used 
does not matter. We will indicate this by using "j" to stand for either of the above functions. 



40 Chapter 3. The Quantifier-Free Theory Ti 

Claim 3.2.18 jx ■ j = j ■ jx 

Proof By NIND on x: je-j = e- j= j= j- e = j- je, j{xi) ■ j = (jx ■ j) ■ j = (j • jx) ■ j = 
j ■ {jX ■ j) = 3 ■ j{xi). □ 

Claim 3.2.19 jX ■ jy = jy ■ jX 

Proof By NIND on y, and by Claims [3. 2. 171 and 13. 2. 181 

jx-j (yi) = jX ■ jy ■ j = jy ■ jx ■j=j (yx) ■j=j-j (yx) = j ■ jy ■ jx = jy ■ j ■ jX = j (yi) ■ jX. □ 

Corollary 3.2.20 j{xy) = j{yx) 

Claim 3.2.21 jX = j{Qx) and jx = j{ix) 

Proof (We will prove only the first property, the second one being almost identical.) By 
NIND on x: j£ = j(oe), j{xi) = jx ■ j = j(ox) • j = j(ox • 0) = jio{xi)). □ 

Corollary 3.2.22 qx = oy ^ ix = ly 

Claim 3.2.23 >jx = jx< 

Proof By NIND on x, and by Claim [3?2. 181 >je = >e = e = e< = je<, >j{xi) = >{jX ■ j) = 
HJ ■ jx) = jX = ijx ■ j)< = j{xi)<. □ 

Claim 3.2.24 (L) >{jx) = j{>x) (R) j{x<) = {jx)< 

Proof (L) By NIND on x, and by Claim [32T71 >{je) = >e = e = je = j{>e), Hjiix)) = 
Hj ■ jx) = jX = j{>{ix)). □ 

Claim 3.2.25 x = e-<-^jX = £ 

Proof By NIND on x: e = e ^ je = e, xi = e ^ jx ■ j = e. □ 
Claim 3.2.26 (L) x\>y = jX[>y (R) y < x = y <i jx 

Proof (L) By NIND on x: e\>y = y = je>y,ix\>y = >{x >y) = >{jX > y) = j ■ jX\> y = 
j{ix) >y. □ 

Claim 3.2.27 (L) x \> jy = j{x \> y) (R) jy <i x = j{y <\ x) 

Proof (L) By NIND on x, and by Claim [3X2il e\>jy = jy = j{e\>y), ix[>jy = >{x> jy) = 
■>j{xt>y)= j{>{x>y))= j{ix>y). □ 
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Claim 3.2.28 j{x > y) = j{y <i x) 

Proof By NIND on x, and by Claims and [3X23} jie\>y) = jy = j{y<ie), j{ix>y) = 
j{>{x > y)) = >{j{x > y)) = >{j{y < x)) = {j{y < x))< = j{{y <\ x)<) = j{y < ix). □ 

Corollary 3.2.29 x \> jy = jy < x 

Claim 3.2.30 (L) ^x = x<lx< (R) >x \> x = x' 

Proof (L) By NIND on x, and by Corollary [3X8l and Claims [3X26] and [3222 'e = e = 

e <\ e<, ^(ix) = i = ix <} X = ix <\ qx = ix <} >(Oo2;) = ix <\ >o{ix) = ix <\ Q{ix)< = ix <] {ix)<. 
□ 

Corollary 3.2.31 (L) \xi) = xi<\x (R) x \> ix = (ix)' 
Claim 3.2.32 

1. (L) y\>xi^e^y\>xi = y\>x-i (R) e ^ ix <y ^ i ■ x <\y = ix <\y 

2. (L) y\>xj^e^y>xi^e (R) e^x<iy^e^ix<iy 

3. (L) o{y ■ (y \> x)) = o{x ■ {x \> y)) (R) o{{x <\y)-y)= o((y < x) ■ x) 

Proof 

1. (L) The first direction is proved by Claim 13.2.121 y[>xi = y\>x-i^yt>xi^£. The 
other direction is proved by NIND on y: £\>xi^£^£[>xi = £\>x-i, jy[>xi^ 
£ >(y > xi) ^e^y\>xiy^£^y\>xi = y\>x-i^ >(y > xi) = >(y \> x ■ i) = 
y\>xl^^ (e, >(y \> x) ■ i) ^ jy \> xi = jy \> xi (e, jy \> x ■ i) = jy \> x ■ i. 

2. (L) By NIND on y and the preceding claim: e>x = x^£^£\>xi = xi^e, jy>x = 
>{y\>x) £^yt>x 7^ e^y\>xi ^ £^y\>xi = y\>x-i^>{y\>xi) = >{y\>x)-i = jy\>x-i / e. 

3. (L) The claim is proved first under the assumption that x [> y = e (which implies by 
Axioms [Icl and [5cl and by the preceding claims, that y t> xi = y t> x ■ i, and also implies 
that xi\>y = >{x\>y) = e), and then under the assumption that x\>y ^ e (which implies 
by Axioms [Icj and [5cl and by the preceding claims, that y\>xi = y\>x = e). Then, a 
simple application of modus ponens with the tautology x\>y = £\lx\>yi^E yields the 
claim. 

By NIND on x, and under the assumption that xx>y = e: o(y • (y Oe)) = oV = oi^'i^^u)), 
o(y • (y > xi)) = o(y ■{y\>x)-i) = o(y ■ {y \> x)) ■ = o{x ■ {x \> y)) ■ = ox ■ = o{xi ■ e) = 
o{xi ■ {xi > y)). 
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By NIND on x, and under the assumption that x>y ^ e: o{y ■ {y \> e)) = on = o(£" (£l>2/))> 
o{y ■ {y \> xi)) = o{y ■ e) = o(y • {y t> x)) = q{x ■ {x \> y)) = qx ■ q{x> y) = qx -Q ■ >o{x t>y) = 
o{xi) ■ o{xi> y) = o{xi ■ {xi\> y)). □ 

On "?" and related functions 

Now, we will prove a group of theorems about the conditional function "?". Note that in the 
statement of some of the theorems below, we will need to express the fact that terms t and u 
have the same length, something which can be done by the equation jt = ju. 

First, we introduce two new functions defined in terms of "?" that will be used through- 
out the rest of this chapter for notational convenience. Whereas the conditional function "?" 
performs a three-way test on its first argument, the "zero-length conditional" function 
tests whether the length of its first argument is zero or not, and the "even-length conditional" 
function "?^^" tests whether the length of its first argument is even or odd (?^^ has already 
been introduced informally in Axiom [9]). 

Definition 3.2.3 = [Xxyz.x ? (y, z, z)] 

Definition 3.2.4 = [Xxyz. {xM \> ►x) {y, z)] 

(To make the notation more consistent with previous usage, we will write "x (y, z)" and 
"x?^^(?/,z)" instead of the more formal "?^^(x, y, z)" and "?^^(x, y, z)" , respectively.) 

Claim 3.2.33 w? {x,y,z) = w? {x,o{z < y) • y, o(y <i z) ■ z) 

Proof Immediate from Axiom [71 □ 

Claim 3.2.34 w ? {x,y, z) = w' 7 {x,y, z) 

Proof By NIND on w: e ? {x,y,z) = x = e' ? {x,y,z), wO ? {x,y,z) = o{z <\ y) ■ y = 
? (x, y, z) = (wO)' ? (x, y, z),wll (x, y, z) = o(y < • ^ = 1 ? (x, y, z) = {wl)' ? (x, y, z). □ 

Claim 3.2.35 wi (x, y) = y 

Proof By Corollarv l3.2.8t wi (x, y) = wi? {x,y,y) = o{y <\ y) ■ y = qE ■ y = y. □ 
Corollary 3.2.36 iw (x,y) = y 
Corollary 3.2.37 w (x, y) = jw (x, y) 
Corollary 3.2.38 w (y, y) = w1 (y, y,y) =y 
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Theorem 3.2.2 For any k-ary function symbol f, 

{oVi =o^iA---Aoyfc = oZk)^wl{f{xk),f{yk)J{zk)) = f {w1 {xi,yi, zi), . . . ,w7 (xn,yn, Zn)) . 

Proof By NIND on w, and under the assumption that gyi = ozi A • • • A o^fc = ozt- £ ? 
{f{xk),f{yk),f{zk)) = f{xk) = f{£'^{xi,yi,zi), . . . ,e?{xk,yk,Zk)), w07 {f{xk), f{yk), f{zk)) = 
fim) = f{wO ? {xi,yi,zi), ...,w0 7 {xk,yk,Zk)), wl ? {f{xk)J{yk)J{zk)) = f{zk) = f{wl ? 
(xi,yi,zi), . . . j-wl ? □ 

Claim 3.2.39 

U; ? (w ? (Xi,^!,^!),^ ? {x2,y2,Z2),wl (X3,y3,2:3)) =wl (xi, 0(2:2 < y2) ' 2/2,o(y3 < ^s) ' Z3) 

Proof By NIND on w and by Claim l3.2.32l e7 [e? {xi,yi, zi), e7 {x2,y2, Z2),e? {xs,y3, zs)) = 
e?{xi,X2,X3) = xi= e7{xi,o{z2<\y2)-y2,o{y3<iZ3)-Z3), 'w07{w07{xi,yi,zi),w0?{x2,y2,Z2),w07 
{x3,y3,z3)) =wO? (0(2:1 < yi) ■ 2/1,0(22 < ^2) • ^2, 0(2:3 < ys) • ys) = o((o(23 < ^3) • ^3) < (0(2:2 < 

y2) -^2)) -0(2:2 <y2) -^2 = o((o(y3 <2;3) • Z3) < (o(z2 <?/2) ' ^2)) • 0(2:2 < 2/2) ■y2 = w0? (xi, 0(2:2 < 

^2) • y2, o(y3 < 23) • 2:3) , and similarly for wl. □ 

Claim 3.2.40 w ? (xo, yo, zq) ? (xi, yi, 21) = 

w7 {xq? (xi,yi,zi), (0(2:0 < yo) • yo) ? (a;i, yi, 2:1), (o(yo < 2:9) • 2:0) ? (xi,yi,zi)) 

Proof A straightforward NIND on w, very similar to the proof of Claim [5^.2.391 □ 

Corollary 3.2.41 u;?^^ (xo,yo) (a;i,yi) =w?^'' (xo T""^ (xi,yi),yo T""^ (xi,yi)) 

Claim 3.2.42 x (y (zq, u7o), y (21,^^1)) = y (x (29,^1), x?^^ (wo,tyi)) 

Proof By NIND on x: e (y (zo, li'o), y (zi,ii'i)) = y (20,^^0) = y (e ^""^ 
(Z9,zi),e?^^(t/;9,^x;i)), xi?^^(y?^^(z9,u;o),y?^^(2i,u^i)) = y?^^(2i,u^i) = y?^^(xi?^^(29,^i),xi?^^ 
(wo,i«i))- □ 

Claim 3.2.43 w = w ? {e,w< ■ 0,w< ■ 1) 

Proof By NIND on u>: e = s7 [s, e< • 0, e< • 1), u-O = wO ? (e, t(;0, wl) = wQl {e, (u;0)< • 
0, (u'0)< • 1), u-l = ? (e, -wO, wl) =wll (e, ('u;l)< • 0, {wl)< ■ 1). □ 

Corollary 3.2.44 w = wl^^ {e, w) 

Theorem 3.2.3 w7 {x,yo,yi) = z^{w = sAx = z)\/{w = w<-OAo{yi<iyo)-yo = z)\/ {w = 
w<-lA o{yo < yi) ■yi = z) 
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Proof By NIND on w: el {x,yo,yi) = z-^x = z-i-^e = eAx = z, wO ? {x,yo,yi) = 

z'^o{yi<iyo)-yo = z^wO = (u'0)<-OAo(yi <yo) -yo = z, wi?(x,yo,yi) = ^^o(yo<yi)-yi = 

z ^ wl = {wl)< ■ I Ao{yo <]yi) ■ yi = z. □ 

Corollary 3.2.45 w {x,y) = z {w = e A x = z) V {w ^ e A y = x) 
Theorem 3.2.4 For any term u, w (n[e/?i;], n) = u. 

Proof By induction on the structure of u: iiu = w, then wf^^ {u[e/w],u) = w?^^ {e,w) = w 
by Coroharv I3.2.44t if u = x ^ w, then w {u[e/w], u) = w (x, x) = x by Corollary 13.2.381 
if-u = f{h,...,tn), then wl'"^ {u[e/w],u) = w {f{ti[e/w], . . . ,tn[e/w]), f{h, . . . ,tn)) = 
/(u;?^^ {ti[e/w],ti),...,w7^'^ {tr,[e/w],trr)) = f{ti,...,tn) = uhy Theorem [3X2] and the 
induction hypothesis. □ 

Claim 3.2.46 (L) \zx) = z Cx,'z) (R) (xz)' = z {x',z') 

Proof (L) By NIND on z, and by Corollary [3X36} \ex) ='x = e Qx, V), \{iz)x) = i = 
iz?^^ Cx,\iz)). □ 

Claim 3.2.47 (L) >{zx) = z 7^^ {>x,>z ■ x) (R) {xz)< = z {x<,x ■ z<) 

Proof (L) By NIND on z, and by Corollary I3.2.36t >{ex) = >x = e {>x,>e ■ x), 
>{(iz)x) = zx = iz (>x, >{iz) ■ x). □ 

Claim 3.2.48 vl^"- {u,t) = {v ^ e = u) 

Proof By NIND on v: el^^ {u,t) = u ^ u = u {e e ^ u = u), viV^ {u,t) = u t = 
u ^ {vi ^ e ^ t = u). □ 

On and "►" 

Claim 3.2.49 (L) j{x^) = {jx)< (R) ^(►x) = ►Qx) 

Proof (L) By NIND on x, Axioms [9l and various theorems proved above: j(e-^) = £ = 

j{{xi)<) = j{{x< > ►x) (x-^,x-^ • \>-x ■ i))) 

= j{x< > ►x) {j{x<),j{x< ■ '(►x • i))) 
= {jix<)>j{P^x)) {{jX)<,j{x<) ■ V^x • i)) 
= {{,x)M > ►G-x)) 7'^ (G-x)^, {jx)< ■ \>{,x) ■ j)) 
= ijx ■ j)^ = ij{xi))< □ 
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Basic properties of "?^^" can easily be obtained from the basic properties of "?^^" , on which 
it is based. In order to prove properties particular to "?^^", we will need the following lemmas. 
But first, a few reminders. 

• j{>{>'X • i)) = >(^jX • j) = >{j ■ ►jx) = ^(►x) 

• {z ■ i) \> y = j{z ■ i) > y = {j ■ jz) > y = >{jZ > y) = >{z > y) 

Lemma 3.2.50 o^;-^ > (►o^^ • 0) = {ox< t> ►qx) • 

Proof By Axioms Hcl [5c| and[8bl and by Claim 13.2.32] qx< <\ ►qx = e^ox< \> (►qx-O) ^ 
e qx< > (►ox • 0) = (ox-^ > ►ox) • 0. □ 

Lemma 3.2.51 o{xi)< t> ^o{xi) = {qx< > ►qx) (0, e) 

Proof By Lemma 13.2.501 

o(xz)-^ > ►o(xi) = X (ox-^ > (►ox • 0), (ox-^ • X^ox • 0)) > >{>'ox ■ 0)) 
= X {{ox< > >-ox) ■ 0, {ox< ■ 0) > ►ox) 
= (x-^ [> ►x) {o{x< > ►x) • 0, >o{x< > ►x)) 

= (ox^>^ox)?^^(0,e) □ 

Theorem 3.2.5 xi ?™ (y, z) = x (z, y) 

P ROOF By Lemmas 13.2.501 and 13.2.511 

xi ?™ (y, z) = o{xi) l""- {y, z) 

= {o{xi)< > ►o(xi)) (y,z) 

= (ox^>^o^) ?^Mo,e) 

= (ox^>^o^) ?^^(0 r^{y,z),Er^{y,z)) 

= (ox^ > ►ox) (z,y) 

= ox?^M^,y) = 2;?^M^,y) □ 



Claim 3.2.52 (L) (ixj)-^ = i ■ x-^ (R) ►(ixj) = ^x-j 
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Proof (L) By Axioms M and Theorem 13.2.51 



{ixj)< = xj {{i{xj)'*)<,i{xj)<) 




= i ■ xM □ 

Claim 3.2.53 (L) {>x<)< = >{x<) (R) ►(>x<) = (►x)< 

Proof (L) By Derived Rule [Ml] and Claim [3232J {>e<)M = e = >{e<), {>i<)< = e = 
>{i<), {>{ixj)<)< = x< = >{i ■ x<) = >{{ixj)<). □ 

3.2.2 Further definitions and theorems 

In this section, we define many functions in Ti and prove their basic properties. We also give 
(and prove) a number of useful derived rules for Ti . 

From now on, we will not give proofs that consist only in a straightforward application 
of NIND. Proof sketches will be given for more complex theorems, and complete proofs are 
provided in Appendix [A] for most of the theorems below. 

On generalizations of NIND 

First, we define some generalizations of NIND based on Derived Rule l3.2TTl 
Derived Rule 3.2.2 



Proof (We will prove only (R), the case for (L) being almost identical.) Let us define 
a formula EL[x] : o{x< \> ►x) = e. By Lemma I3.2.5H we immediately get that EL[xi] <-> 
^EL[x] <-> EL[ix]. To prove that A is true under the given hypotheses, we will show that the 
hypotheses imply the following two statements: 



(L) A[e], A[0], A[l], A[x] ^ A[00x] A A[Olx] A A[Wx] A A[llx] h A 
(R) A[e],A[0],A[l],A[x]^A[xOO]AA[x01]AA[xlO]AA[xll] \- A 



{EL[x] ^A[x])A {^EL[x] A[x<\), 
{^EL[x] ^A[x])A {EL[x] A[x<]). 



(3.2.1) 
(3.2.2) 



Together with the fact that EL[x] V ^EL[x], this will imply that A[x] AA[x<], i.e., A[x] is true. 



3.2. Developing the theory 



47 



We can prove statement EXH by NIND on x: {EL[e\ A[e\) A {-^EL[£\ A[e<\) is trivially 
true since A[e\ is true by assumption, while 

{EL[xi\ A[xi]) A {^EL[xi] A[{xi)<]) ^ {^EL[x\ A[xi]) A {EL[x] A[x]) 

is true since the second conjunct is true by the induction hypothesis, and so is {^EL[x]^A[x<]), 
which, together with the assumption that ^[x]^^[xji], implies that ^[3;<] — >^[2;< • ji] ^^[xi]. 
The same reasoning applies to statement 13.2.21 which concludes the proof. □ 

Note that this proof can easily be modified to get a similar derived rule for A[x] — > ^[ixj], and 
it can easily be extended to cover other variations of Theorem 13.2.11 

Next, we want to define simultaneous NIND on two variables. Before we can do this, we 
need to define a few functions and prove their basic properties. 

Definition 3.2.5 (L) \b = [Xxy.y7^^{e,\>y>x))] (R) rb = [Xxy.y7^^{e,{x<iy<y)] 
Definition 3.2.6 (L) Ic = [Xxy.x < (y > x)] (R) rc = [Xxy. {x <iy)\> x] 
Definition 3.2.7 m\n^ = [Xxy.x <iy?^^ {x,y)] max^ = [Xxy.x \> yl^^ {x,y)] 
Claim 3.2.54 

1. (L) z \> yx = z \> y ■ {z <\ y) \> X (R) xy<\z = x<\{y\>z)-y<\z 

2. (L) yO X = \b{x,y) ■ y \> X (R) x Oy = x < y ■ rb{x,y) 

3. (L) y \> {{x <\ y) > x) = e (R) (x <i {y \> x)) <i y = e 

4. (L) \c{rc{x,y),y) = rc{x,y) (R) rc{\c{x , y) , y) = \c{x , y) 

5. (L) \y\>x) = y{>x'!^^ [e,{x<i>{y[>x)y) (R) {x <iyy = x 'Qyl^^ [e,\{x <iy)<> x)) 

6. (L) \c{x , yi) = \c{x , y) ■ \b{x , yi) (R) rc{x , iy) = rb{x , iy) ■ rc{x , y) 

7. (L) \c{x,y) ■ y > X = X (R) x = x <\ y ■ rc{x,y) 

Now, we can state and prove a derived rule for simultaneous NIND on two variables. 

Derived Rule 3.2.3 

(LL) A[e,y],A[x,e],A[x,y]^A[Ox,Oy]AA[Ox,ly]AA[lx,Oy]AA[lx,ly] h A 

{LR) A[e,y],A[x,e],A[x,y]^ A[Qx,y^]AA[Qx,yl]AA[lx,yQ\AA[lx,yl\ h A 

(RL) A[e,y],A[x,e],A[x,y]^A[xO,Oy]AA[xO,ly]AA[xl,Oy]AA[xl,ly] h A 

(RR) A[e,y],A[x,£],A[x,y]^A[xO,yO]AA[xO,yl]AA[xl,yO]AA[xl,yl] h A 
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Proof (We will prove only (RR), the other cases being almost identical.) Under the given 
assumptions, we will prove A[xl ■ \c{xr, z),yL ■ \c{yR,z)] by NIND on z, where 



Base case: A[xl- \c{xji, e),yL- lc(yR, e)] = A[xl, Vl]- By the definition of min^, we know that 
m\n^{x, y) = x\J min^(x, y) = y, which means that xl = yL = £, which implies (^[a;^, y^] ^ 
A[xl,£]) V {A[xL,yL] ^ ^[SjZ/l])) so we know that A[xL,yL] is true by the assumptions. 

Induction Step: we have that 

A[xL ■ \c{xR, zi), yL ■ \c{yR, zi)] = A[xl ■ \c{xr, z) ■ \b{xR, zi),yL ■ \c{yR, z) ■ \b{yR, zi)], 

which follows directly from the induction hypothesis by the assumptions. 

Finally, we know that A[xl ■ \c{xr, min^(a;, y)), yL ■ \c{yR, min^(a;, y))] = A[xl ■ xr, yL ■ yR] = 
A [x, y], which completes the proof. □ 

Note that this rule, and its proof, can easily be extended to more than two variables, giving us 
a very useful form of NIND on many variables. 

On propositional reasoning 

Now, we will show how to formalize propositional connectives in Ti. (The definitions are 
identical to those for Li, and we use "a; ?^ {y,xy' instead of the more formal "?^(a;, y, 2;)" .) 



Definition 3.2.8 

1. 7'' = [Xxyz.xl {z,z,y)] 

2. [Ax. X?'' (1,0)] 

3. = [Xx.x?'' (0,1)] 

4. A«= [Xxy-x?"" {^''y,0)] 

5. V« = [Xxy.x?'' (l,«^j/)] 

6. [Xxy.x?'' (f^^y,!)] 

7. ^^ = [Xxy.xr{^^y,^^y)] 

8. = [Axy. X ?«(-«?/, !^«y)] 



The properties of "?" already proven immediately extend to ?^ in the obvious way, and the 
following theorem follows directly from these properties. 



XL = X <\ min^(x, y) 
yL = y< mm^{x,y) 



Xr = Yc{x, m\n^{x, y)) 
Vr = rc(y,min^(x,y)) 
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Theorem 3.2.6 



1. ss^x = 1 V 



= 



2. Ri^^'^X = Ki^X 



4. X A'^y = 1^ {^'^x = 1 A R^^'y 



1) 



5. X y = 1 ^ (rs^x = 1 V Ri^y 



1) 



6. X -^^ y = l^ {k.^x = 1 — > pa^y = 1) 



7. a; y = 1 ^ (si^x = 1 <-> «^y = 1) 

8. X y = 1 (r^^x = 10 f^^'y = 1) 

This theorem gives us direct proofs of the usual properties of the defined connectives, from the 
corresponding properties of the connectives in Ti, and it allows us to introduce the following 
notation: we will write "t" instead of "i = 1" for Ti-terms t. (For example, we could state that 
"«^x -.^x" is a theorem.) 

On variations of TRN 

To define functions by "simple" TRN, we will use STRN[5', h\ as shorthand for 



The following property is then a direct consequence of the axiom for TRN. 
Claim 3.2.55 

STRN[g, h]{x, y) = x< (y(x, y), ^(x, y, STR%, h]{x<, y), STm[g, ^](^x, y))) 
On "AND" and "OR" 

Definition 3.2.9 AND = STRN[Ax.x, Axi;^^^.?;^ A^u,.] 
Definition 3.2.10 OR = STRN[Ax.x, Axv^u^.u^ v^] 

We can use TIND to prove the following simple theorem (we give the proof here to illustrate 
the use of TIND). 

Theorem 3.2.7 AND(ix) = x (e, 1) 



[Axy . TRN[Ax2;y . g{x, y), XxzyvgVr ■ h{x, y, vg, Vr), Xz. z, Xz. z]{x, e, y)] . 
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Proof By TIND on x: AND(ie) = AND(e) = e = e l^"- (e, 1), AND(ii) = AND(l) = 
1 = AND(ix) = AND((ia;)^) A« AND(^(ix)) = AND(i(x^)) A« AND(i(^x)) = 

x< (e, 1) A« ►x (e, 1) = 1 A« 1 = 1. □ 

Similarly, we can prove that OR(ox) = x (e,0). 

Now, we want to prove some more basic properties of AND and OR, the most important 
of which being AND(xy) = AND(x) A^ AND(y) (and similarly for OR). This property would 
naturally be proved using NIND since it involves the concatenation of two variables, but AND 
is defined by TRN which makes it more natural to use TIND. In fact, we will use TIND to 
prove the property but because of the messy interaction between concatenation recursion and 
tree recursion, the proof will unfortunately not be as simple as one might expect. 

Before we get started, note that it is a simple matter to extend Derived Rule I3.2T2] to give 
us rules similar to the following ones. 

Derived Rule 3.2.4 ^[0], y / e A A[y] A[yQ] A A[yl] K x / e ^ A[x\ 
Derived Rule 3.2.5 

^[00], A[01], A[10], A[000], . . . , ^[111], ^ e A A[y<\ A A[^y] ^ A[y] 

h x/eAx/OAx/1^ A[x] 

These rules can then be used to prove the following claim and theorem. 

Lemma 3.2.56 

1. (L) x7^eAx/0Ax/l^ {>x ■ j)-^ = >{x<) ■ Vx 
(R) x/eAx/OAx/1^ ►(>x • j) = »-x ■ j 

2. (L) ^xA«AND(>x-i) = AND(x) A^j (R) j A« AND(x) = AND(j • x<) A« x' 

3. 'x A« AND(>x) = AND(x) = AND(x<) A^ x' for x / e, 0, 1 

Theorem 3.2.8 AND(xy) = AND(x) A^ AND(y) forx,y/e 

Proof By Lemma 13.2.561 and by Derived Rule [3?2^ on y: AND(xi) = AND(x) A^ i = 
AND(x)A«AND(i), AND(x(yi)) = AND(xy)A«i = AND(x) A«AND(y) A^i = AND(x) A«AND(yi). 
□ 

Note that a similar lemma and theorem can be used to show OR(xy) = OR(x) OR{y) for 
x,yy^e. 
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On generalizations of CRN — part I 

Now, we will define simultaneous concatenation recursion on notation for many variables, prove 
its basic properties, and define a few more useful functions based on this generalized CRN. But 
first, we must prove a number of technical lemmas. 

Theorem 3.2.9 jX = jy^x>y = £ = x<y 

Claim 3.2.57 For f = rCRN[h], 

1- jifix,y)) =jx 

2. f{x,y) <z = f{x< z,y) 

3. \b{f{x,y),z) = x> zl^'^ {{0-h{\c{x,z),y)y,£) forx,z^£ 

4. \c{f{x,y),z) = f{\c{x,z),y) 

(A similar claim can be proved about £CRN.) 

Definition 3.2.11 (L) \pj = [Xxy.j{y <ix) ■ x] (R) rpj = [Xxy.x ■ j{x > y)] 
Definition 3.2.12 

max^ = [Ax. x] 
max^^]^ = [Axxjfc. max^(x, max^(a;fc))] 

Lemma 3.2.58 

1. (L) y > x = X > yl^^ {y > x,£) (R) x < y = x > y {x < y,£) 

2. (L) (y < {x t> y)) t> X = y t> X (R) x < {{y < x) :> y) = x < y 

Claim 3.2.59 

1. (L) lpj(x,max^(a;,y)) = lpj(x,y) (R) rpj(x, max^(x, y)) = rpj(x, y) 

2. (L) \h{\pf^{xi,max^{xi,yj)),max''{xi,yj)) = i 
(R) rb(rpo(ix, max^(ix, jy)),max^(ix,jy)) =i 

(Note that this can easily be generalized to more than two variables.) 
Now, we are ready to define £CRN^ and rCRN^. 
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Definition 3.2.13 

^CRNm[/i] = Xxrny.£CRN[Xzxrny-h{rc{xi,z), . . . ,rc{xm,z),y)] 

(max^(xr„), lpo(xi, max^(xm)), . . . , \po{xm, max^(xm)), y) 

rCRNm[h] = Xxrny-rCRN[XzXrny-h(\c{xi,z), . . . ,\c{xrn,z),y)] 

(max^(xm), rpo(a;i, max^(xm)), . . . , rpo(xm, max^(xm)), y) 

A simple application of Derived Rule 13.2.31 together with Claim I3.2.59[ both generalized to the 
m variables xi, . . . , Xm, suffices to show the following basic theorem about ^CRN^ and rCRN^. 

Theorem 3.2.10 

iCRNm[h]{xi, ... ,Xm,y) 

= Xl Xm'^^^ (^e/(/i(lpo(xi, max^(fm)), . . . , lpo(a;m, max^(xm)), y) • O) 

■ £CRNm[h]{>\pQ{xi, max^(xm)), . . . , >\pQ{xm, max^(fm)),y 
rCRNm[h]{xi, . . .,Xm,y) 

= Xl rcm, (^e,rCRNm[h]{rpQ{xi, max^(fm))<, . . . , rpo(xm, max^(xm))<, y) 

• (0 • /i(rpo(xi, max^(f„)), . . . , rpo(xm, max^(f„)), y))' 

Using ^CRNm, we can now define some useful functions and prove their basic properties 

Definition 3.2.14 

1. not« = £CRN[Ax.^«Cx)] 

2. and^ = £CRN^ [Ax^ . (^xi) A« • • • A« (^x„)] 

3. = iCRNm [Xxm . Cxi) • • • V« (^x„)] 

4. xoC = ^CRN^ [Ax„ . (^xi) • • • e« 

5. ifF^ = ^CRN^ [Ax„ . iCxi) ^« (^X2)) A« • • • A« ^« 
Claim 3.2.60 

1. and^(xiii, . . . , Xmim) = and^(xm) • and^(i'm) 

2. jxi = ■■■ = jXm A j-yi = ■ ■ ■ = jym and^(xi?/i, . . . , x^ym) = and^(xm) • and^(ym 

3. and^(xm)< = and^(xi<, . . . ,x„<) 

4. and^(xm) <iy = and^(xi < y, . . . , x^ < y) 
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And similarly for or^, iff^, and not^. We can now prove a theorem relating the functions AND 
and OR to each other. 

Theorem 3.2.11 ^«AND(x) = OR(not«(x)) and ^«OR(x) = AND(not^(2;)) for x ^ e 
On generalizations of CRN — part II 

Finally, we are ready to generalize CRN to operate on blocks of bits instead of single bits. For 
technical reasons to be discussed below, this will be done only for variable-length blocks of bits 
whose lengths are powers of two. 

In order to get such a version of CRN, we first need to define a length-division function. 
Ideally, we would like to define a function div^(x,y) whose length would be equal to [|a;[/[y|J, 
but this seems to be impossible in Ti. Instead, we can define a function powdiv^(x, y) whose 
length is equal to [jx|/2l^^^ '^'^ J , i.e., the function divides the length of x by the smallest power 
of 2 larger than or equal to the length of y. Luckily, this will be sufficient for our purposes, as 
will be seen in Chapter [5l 

The first functions we define are a function that returns a string whose length is the smallest 
power of two larger than or equal to the length of its input, and a function that tests whether 
or not the length of its input is a power of two. 

Definition 3.2.15 pow^ = STRN[i, XyviVr-Vr ■ Vr] 

Definition 3.2.16 ispow^ = STRN[Ay.e, Ayw^v^.y 

(Note that ispow^ returns e if the length of its input is a power of two and 1 otherwise.) The 
basic properties of pow' and ispow^ are now easy to prove by TIND. 

Claim 3.2.61 

1. ►pow^(y) = pow^(^y) = pow^(2/)-^ (for y^e, 0,1) 

2. ipow^(y) = pow^(iy) = pow^(y) 

3. pow^(pow'(y)) = pow^(y) 

4. pow^(y) \> y = e 

5. ispow^(pow^(y)) = e 

Before we can define the length-division function, we need to define a "length multiplication" 
function (this is just the "smash" function). 

Definition 3.2.17 # = STRN[Xxy.x?^^ {e,y),Xxyv£Vr.Vi ■ Vr] 
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These properties of # can then be proven with simple apphcations of TIND and NIND. 

Claim 3.2.62 

1. e#y = e = x#£ 

2. (L) ,{xi#y) = i{x#y)-iy (R) ^{x # yi) = i{x # y) ■ ix 

3. (L) ,{{x-y)#z) = i{x#z)- i{y # z) (R) i(x # (y • z)) = i(x # y) ■ i(x # z) 

4. ^{x#y)=i{y#x) 

Now, we can define the powdiv^ function, and a corresponding powmod^ function. 

Definition 3.2.18 

powdiv^ = TRN[Xyx.y (e, ix), Xyxv^Vr-Vr, -4, m] 
powmod^ = [Xxy. (pow^(y) # powdiv^(x, y)) > ix] 

Straightforward applications of TIND then prove these basic properties. 

Claim 3.2.63 

1. powdiv^(a;,y) = powdiv^(ia;, ij/) 

2. powdiv^(a;,y) = powdiv^(x, pow^(j/)) 

3. x> pow^(y) 7^ £ — > powdiv^(a;, y) = £ 

4. powdiv^(pow^(y) #z,y) = y (e, iz) 

And properties of powmod^ follow directly from the properties of powdiv^. 

Corollary 3.2.64 

1. powmod^(a;, y) = powmod^(ix, ly) 

2. powmod^(a;, y) = powmod^(x, pow^(y)) 

3. x> pow^(y) ^ s —>■ powmod^(a;, y) = ix 

4. powmod^(pow^(y) # z,y) = e 

We need just a few more technical lemmas about powdiv^ and powmod^ before we can define 
generalized CRN and prove its properties. 
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Claim 3.2.65 

1. x\> (pow^(y) # powdiv^(x, y)) = e 

2. y / e ^ powmod^(x, y) > pow'(y) / e 

3. y ^ e ^ powdiv^(xl, y) = powdiv^(x, y) • ((powmod^(x, y) • 1) > pow^(y) (1, e)) 
y ^ e ^ powmod^(xl, y) = (powmod^(x, y) ■ \) \> pow^(j/) (e, powmod^(x, y) • l) 

4. powdiv^((pow^(y) ^ z) ■ x,y) = y (e, iz) • powdiv^(x, y) A 
powmod^((pow^(i/) #z) • = powmod^(3;, y) 

5. y e A X \> pow^(y) = e ^ powdiv^(x,y) = powdiv^(x < pow^(y),y) • 1 
y / e A X 1> pow'(y) = e ^ powmod^(3;, y) = powmod^(x <] pow'(y), y) 

At last, we are ready to define the generalized versions of ^CRN and rCRN which we will 
name "^powCRN" and "rpowCRN". Functions defined by ^powCRN or rpowCRN take two 
extra parameters u, v as input, and essentially perform CRN on their first input by replacing 
blocks of |pow^(n)| bits with blocks of |pow^(v)| bits. Just like (2^^', 2^)-CRN in Chapter [21 we 
will simulate these generalized forms of CRN by using powdiv^ and powmod^ to extract the 
correct substring of the first input to pass to h and to output the correct bits of h in sequence. 
Intuitively, £powCRN[(7, h] and rpowCRN[5(, h] will behave as follows (for all strings z such that 
l^l = |pow^(n)|). 

^powCRN[(7, u, f , y) = g{x,u,v,y) (if \x\ < |pow^('u)|) 
£powCRN[y, h](z ■ x, u, v, y) = \aQ(^h{z ■ x, u, v, y), pow^(t>)) • ipowCKN[g, h]{x, u, v, y) 

rpowCRN[y, /i](x, M, y) = g{x,u,v,y) (if |x| < |pow^(u)|) 
rpowCRN[y, h]{x ■ z,u,v,y) = rpowCRN[y, h]{x,u,v,y) ■ rao (/i(x • z, u, f , y) , pow^(t>)) 

Where we have used the functions \aj and raj, whose definitions and basic properties (easily 
proved by Derived Rule 13.2.3]) appear below. 

Definition 3.2.19 

(L) \aj=[Xxy.j{y<\x)-{{x<y)\>x)] (R) raj = [Xxy . {x < {y \> x)) ■ j{x \> y)] 
Claim 3.2.66 



1. (L) i(laj(x,y)) = ly (R) ly = i{raj{x,y)) 

2. (L) \c{\aj{x,y),y) = \aj{x,y) (R) raj{x,y) = rc{raj{x,y),y) 
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Definition 3.2.20 

ipowCRN[g,h] = Xxuvy.lCKN Xzxuvy. 

rb^lao ^/i(rc(x, (pow^(n) # (1 • powdiv^(>2;, i;))) • powmod^(x, u)) , n, f , y) , 
pow^(w)^ , 1 • powmod^(>z, v)^ {po\N^{v) # powdiv^(j;, u), x, u, v, y) 

■ g[rc{x, powmod^(x, u)), u, v, y) 
rpowCRN[(j(, /i] = Axuvy . 5(lc(x, powmod^(x, u)), M, t>, y) • rCRN Xzxuvy. 

lb^rao(^/i(lc(a;, powmod^(x,n) • (pow^(ti) # (powdiv^(2;<, f ) • 1))) , -u, u, y) , 
pow^(i;)^ , powmod^(z<, v) ■ 1^ (pow^(u) # powdiv^(x, u),x, u, v, y) 

Theorem 3.2.12 For u^^ e and v ^ e, 
£powCRN[g, h] {x, u, v, y) = 

x D> pow^(ii) ^lao(/i(x, u, y), pow^(t!)) • ipowCRN[g, /i](pow^(ii) > x, u, v, y),g{x, u, v, y)^ , 
rpowCRN[g, h](x, u, v, y) = 

x > pow^(u) ( rpowCRN[g, h]{x < pow^{u),u, v, y) ■ rao(/i(x, u, v, y), pow^(t;)), y(x, u, v,y)). 



Proof We prove the theorem for rpowCRN only, the case for ^powCRN being almost iden- 
tical. To start with, if x > pow^(ti) ^ e, then Claim [H.2.631 and Corollary 13.2.641 give us the 
result immediately since powdiv^(x,u) = e and powmod^(2;, n) = ix. 

Next, suppose that x \> pow^(ti) = e. Then, Claim [3^2.651 implies that powdiv^(x,u) ^ e, 
which shows that pow^(t;) # powdiv^(2;, u) = {po\N^{v) # powdiv^(x, u)<) ■ po\N^{v) = {po\N^{v) # 
powdiv^(x<lpow^(ti),n))-pow^(i;). The following facts are then direct consequences of preceding 
claims, and hold for all strings z such that z ^ e A z < pow^{v) = e: 

• lc(x, y ■ z) = lc(x, y) • lc(y \> x, z) (easy to prove by NIND on z), 

• powmod^((pow^(u) # powdiv^(a;, n)<) • iz<, pow^(?;)) • 1 = powmod^(i2:<, pow^(i))) • 1 = 
iz<-l = iz, 



• Ic^x, powmod^(x, u) • ^pow^(u) # (powdiv^((pow^(t)) # powdiv^(x, u)<) • z<,v) ■ 1 
lc(x, powmod^(x, u) ■ (pow^(u) # (powdiv^(x, u)< ■ 1))) = lc(x, ix) = x. 

These facts can be used to prove by NIND on z that 

X > pow^(u) = e A z <i po\N^{v) = e ^ f[{po\N^{v) # powdiv^(x, u)<) ■ iz, x, u, v, y) = 

/(pow^(t)) # powdiv^(x < pow^(u), n), X, u, v, y) • rc(rao(/i(x, u, v, y), pow^('t;)), z) 

(where we use "/" to denote the function defined by rCRN in the definition of rpowCRN), and 
putting z = pow^(v) in this last fact gives us the theorem. □ 
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3.2.3 Numerical definitions and theorems 

In this section, we will give definitions for numerical predicates and functions (i.e., ones that 
treat their string arguments as encoding binary numbers) and prove their properties. 

On "=~" and "<~" 

The definitions of "=~" and inside Ti are the same as in Li. Intuitively, x y if the 

two strings are equal when padded on the left with O's to the same length. 

Definition 3.2.21 =~= [Ax?/.AND(1 • ifff(x,y))] 

In a similar way, x <^ y if there is a bit position where x has a 0, y has a 1, and the portions 
of X and y to the left of that position are numerically equal. 

Definition 3.2.22 

<~= [Axy. 0R(rCRN2[Axy. (x' y') A« (x< =~ 2/<)](lpo(^, y), IPo(y, x)))] 

(Where we used "i <^ j" as shorthand for "-i^i j" .) 

Now, we prove basic properties of the two predicates just defined. A simple NIND suffices 
to show the following theorem. 

Theorem 3.2.13 

1. X =^ e -f-^ X = qx 

2. X y^ lpo(x, y) = lpo(y, x) 

This immediately implies that "='^" is an equivalence relation. If we define =^= [Xxy. {x [> y) ■ 
(x <l y) (e, 1)] and =^= [Xxy. -i^(x =^ y) x =~ y], then the theorem we just proved imme- 
diately implies that x =^ y ^ x = y. Together with the facts about propositional connectives 
proved in Theorem 13.2.61 this means that for any formula A of Ti , there exists a term A of Ti 
such that Ti can prove A^ A (with our usual convention whereby "A" stands for the formula 
A = l). 

Now, we prove more properties of =^ and <''^. By Theorem 13.2.101 we have the following 
two theorems. 

Claim 3.2.67 

1. xO yO ^ X =^ y ^ xl =^ yl 

2. ^B(xO yl) 
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3. =^ yO) 



Claim 3.2.68 



1. 


xO <~ 


yO = 


X <~ 


yv« 
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=^yA« 


<^ 


0) 


= X <^ 
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2. 


xO <~ 


yl = 


x <~ 
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=~yA« 
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= x <^ 
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xl 


yO = 


X <~ 
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0) 


= X <^ 
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4. 


xl <~ 


yl = 


X <^ 
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1 <^ 


1) 


= X <^ 


y 



Simple proofs by NIND now suffice to show the following lemma. 

Claim 3.2.69 

1. -^(x <~e) 

2. -«(x <~ x) 

3. -^(e <^ 0^) 

Using the notation "x y" for y <^ x, "x <''^' y" for x <^ y x =^ y, and ">^" similarly 
defined, we have the following theorem. (We give its proof here because it is representative of 
the kind of proof that will be used for most theorems concerning numerical functions.) 

Theorem 3.2.14 

(x y A« -^(x =~ y) A^ -^(x y)) 
(-^(x <~ y) A^ X =^ y A^ -«(x >~ y)) 
(-^(x <^ y) A^ -«(x =^ y) A^ x >~ y) 

Proof By Derived Rule 13. 2. 3^ and the lemma above: When y = e, the statement of the 
theorem reduces to (x =^ e A^ -i^(x e)) e) A^ x >'^' e), which can be proved by 

regular NIND on x: e =~ e A^ ^«(e >~ e), (xO =~ e A« ^«(xO >~ e)) V« (^^(xO =^ e) A« xO >~ 
e) = (x =~ e A« -«(x >~ e)) V« (-^(x =~ e) A« x >~ e), (xl =~ e A^ -«(xl >~ e)) V« (-«(xl =~ 
e) A^ xl e) = e xM^ e =^ x = e <^ x. We can show that the statement holds when x = e 
in the same way. Next, we have four cases to consider: 

(xO <~ yO A^ -«(xO yO) A« -^(xO yO)) 
V^(-^(xO yO) A« xO =~ yO A^ -^(xO >~ yO)) 

V^(-^(xO yO) A« -«(xO =^ yO) A^ xO >~ yO) = (x <^ y A« -^(x =~ y) A« -^(x >^ y)) 

V«(-^(x <~ y) A^ X =^ y A« -^(x y)) 
V«(-^(x <~ y) A^ -«(x =^ y) A^ x >~ y) 
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and similarly for xl,yl, 

(xO <~ yl -«(xO =^ yl) -^(xO yl)) 
V^(-^(xO <~ yl) A« xO =~ yl A« -^(xO yl)) 

V^(-^(xO <~ yl) A^ -^(xO =^ yl) A^ xO yl) = (x y A^ 1 A^ -^(x >^ y)) 

V^(-^(x <~ y) A^ A^ -«(x >^ y)) 
V^(-^(x <~ y) A^ 1 A^ X y) 
= (x<~yA«-«(x>~y)) 
V^(x =~ y A^ -^(x >~ y)) 
V^(-^(x <~ y) A^ -«(x =~ y) A^ x >~ y) 

and similarly for xl, yO. □ 
Corollary 3.2.70 e x 

Corollary 3.2.71 x y = x y A^ x >^ y 

Next, from the fact that lpo(Ox,y) = lpo(x,y) V lpo(Ox,y) = • lpo(x,y) (which can easily 
be proved by cases depending on the length of x [> y) , simple proofs by Derived Rule 13.2.31 show 
the following lemma. 

Lemma 3.2.72 

1. X =^ y = Ox =^ y = X Oy = Ox =^ Oy 

2. X y = Ox Oy 

3. X <^ y = Ox y = X <~ Oy 

This lemma can be used, with a generalization of Derived Rule l3.2THl to three variables, to show 
the following theorems and their corollaries (from Theorem 13. 2. 13p . 

Theorem 3.2.15 x =~ y A y <~ z ^ x <'^' z and x y A y z ^ x >'^' z 

Corollary 3.2.73 x =~ y A y z ^ x z and x y A y >~ z ^ x >~ z 

Theorem 3.2.16 x <^ y A y <^ z ^ x z and x >^ y A y >^ z ^ x >^ z 

Corollary 3.2.74 x y A y z ^ x z and x >^ y Ay >^ z ^ x >^ z 

Corollary 3.2.75 x y A y z ^ x z and x >^ y Ay >^ z ^ x >^ z 



60 Chapter 3. The Quantifier-Free Theory Ti 

On "I -I" and "succ~" 

Now, we define the binary length function "|-|" and the numerical successor function "succ~" 
as in Li, and prove some of their basic properties. 

Definition 3.2.23 

cuss~ = £CRN[Ax. AND(l>x) ?^ {^^'x, V)] 
succ^ = [Ax. cuss^(Oa;)] 

Simple proofs by NIND show the following theorem (proving the relevant properties first for 
the auxiliary function cuss", and then for succ*^). 

Claim 3.2.76 

SUCC~(£) = 1 

succ~(xO) = 0x1 
succ^(xl) = succ^(x) • 

Using this theorem, a simple NIND will now prove the following properties. 
Claim 3.2.77 

succ~(Ox) = • succ~(x) 

succ~(lx) = 'succ-"'(x) • -^^succ^(x) • >succ^(x) 
'succ''(x) = AND(lx) 

succ~(x) = AND(lx) T'' (l • oa;,0 • >succ~(x)) 

Now, we can prove a few theorems involving succ'"* together with some of the other numerical 
functions already defined. 

Theorem 3.2.17 

1. X <~ succ~(x) 

2. X >~ y = X >~ succ~(y) 

The binary length function is defined in the same way as in Li, as follows. 

Definition 3.2.24 |-| = STRN[i, Axu^w,.. x {ve • 0,u^ • 1)] 

(To be consistent with previous notation, we will write "|x|" instead of the more formal "||(x)".) 
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Claim 3.2.78 \x\ = \jx\ 

Theorem 3.2.18 jX = jy ^ \x\ = \y\ 

Proof One direction {jx = jy ^ \x\ = \y\) is immediate from the preceding claim. The 
other is proved by TIND on y (with = M and hr = ►): |x'| = |e| —i- \x\ = e^x = £^ jx = jS, 
\x\ = |i| ^ |x| = 1 — > X = — > jX = ji, and assuming that \x<\ = \y<\ jX< = jy< and 
[►x| = \>-y\ j>-x = j>-y, we have that 

|x| = \y\^x {\xM\ ■0,\xM\-l)=y {\yM\ ■ 0, \yM\ ■ l) 

[x< > ►x = £ A > ►y = £ A |x<^| ■ = \y<\ ■ O) V 
(x-^ > ►x 7^ £ A > ►y 7^ £ A |x-^| ■ 1 = \y<\ ■ l) 
(x-^ > ►x = £Ay-^I>^y = £A jX-4 = jy<) V 
(x-^ > ►x = i A y-^ > ►y = A jx-^ = jy<) 

(where the two cases for |x-^| • = \y-4\ ■ 1 and \x-4\ ■ 1 = |y-^| ■ were not included in the 
disjunction on the second and third lines since they are known to be false). □ 

The following theorem can be proved with an easy TIND and its corollaries are immediate 
from previously proved theorems. 

Theorem 3.2.19 |xi| =~ succ^(|x|) 

Corollary 3.2.79 |x| <~ \xi\ 

Corollary 3.2.80 x >~ |y<| x >~ |y| 

On "masking" functions 

In order to define binary addition, and to prove its properties, we will need "masking" functions 
like the ones that were defined in Li. We give their definition and basic properties here. 

Definition 3.2.25 

firsto = rCRN[Ax. AND(lx<) (-V,0)] 
firsti = rCRN[Ax. OR(x<) 7" (0,x')] 

Definition 3.2.26 maskbit = [Axy. OR(andf (x, y))] 

Definition 3.2.27 delfirsti = [Ax. andg (x, not^(firsti(x)))] 
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The basic theorem below, as well as its corollary, can both be proved with a simple NIND. 
Theorem 3.2.20 



Corollary 3.2.81 firsto(x) = firsti(not^(3;)) 
On binary addition 

Before we define binary addition and prove its properties, let us make a remark about "numer- 
ical" functions. If a formula A contains only terms made up of functions / with the property 
that /(xi, . . .,Xm) = /(lpo(a;i, max^(xm)), . . . , lpo(a:;m, max^(fm))) (which happens to be the 
case for the numerical functions), then A[xi, . . . , Xm] ^ {jXi = • • • = jXm A[xi, . . . , Xm]) ■ 
Thus, we can use the following special form of Derived Rule l3.2T3] to prove any such formula A 
(the rule is stated only for two variables but can easily be extended to more). 

Derived Rule 3.2.6 A[e,e], jx = jy A A[x,y] ^ A[0x,0y] A A[Ox,ly] A A[lx,Oy] A A[lx,ly] 
h jx = jy A[x,y] 

(The conclusion of the rule can easily be proved from the antecedent by a simple application of 
Derived Rule[M31) 

Now, binary addition is defined just as in Li, as follows. 

Definition 3.2.28 carry'"* = ^CRN2 [Xxy. maskbit(andf (x, y), firsto(xorf (x, y)))] 

Definition 3.2.29 -|-~ = [Axy.xorf(carry'^'(^>y) '^i^^u)] 

(To make the notation consistent with previous usage, we will write "x -l-'^ y" instead of the 
more formal "-|-~(x, y)".) The commutativity of is a direct result of the commutativity 

of each function involved in its definition. 

Theorem 3.2.21 x y = y x 

Proving the associativity of will be slightly more complicated. First, we relate the func- 
tions -|-'^ and succ'^ through the following lemma and theorem. 

Lemma 3.2.82 



firsto(Ox) 
firsti(Ox) 



• firsti(a;) 



1 • ox 



firsto(lx) = • firsto(x) 
firsti(lx) = 1 • oa^ 



carry'^(j;0, 1) = carry'^(x, e) ■ = qxO 
carry~(xl, 1) = x l^"" (l, carry-^'(^) 1) ' l) 
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Theorem 3.2.22 x +~ 1 = x (O • succ''(x), succ~(x)) =~ succ~(x) 
Next, we can state certain facts about the carry function. 

Claim 3.2.83 For jx = jy, 

carry^(x, e) = ox 

carry''(2;, x) = x 
carry''^'(Ox, Oy) = • carry'^(x, y) 
carry''^'(lx, Oy) = Varry''(x, y) ■ carry''^(x, y) 
carry''^(lx, ly) = 1 • carry'"'(x, y) 

Note that we omitted the property carry'^(Ox, ly) = ^carry~(x, y) ■ carry~(x, y) from this theorem 
since it follows directly by the commutativity of carry"*. This will be the case for many of the 
theorems and proofs about +^ that we will now present: for the sake of brevity, we will omit 
statements and proofs that follow directly from previous ones by commutativity. The following 
claim follows directly from the corresponding properties for carry*^. 

Claim 3.2.84 For jx = jy, 

X +^ e = Ox 
X X = xO 
Ox +~ Oy = • (x +~ y) 

Ix +~ Oy = \x +~ y) ■ -«Xx +~ y) ■ >(x +^ y) 
Ix +~ ly = 1 • (x +~ y) 

Now, although we can use Claim [3^^2.841 to prove theorems about +^ by Derived Rule l3.2T6l we 
will also have need of the following theorem further on. 

Claim 3.2.85 

xO +^yO = (x +~y)-0 
xl+^yO = (x +~y)-l 
xl +~yl = >succ''(x +''y) • 

With the help of this theorem, we can now prove the following important properties of +^ 
with a version of Derived Rule 13.2.^ that concatenates bits to the right instead of to the left. 
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Theorem 3.2.23 

1. X +^ succ~(y) =~ succ~(x +^ y) 

2. x+^iy+^'z) =~(x+~?/)+~z 

3. y <^ z X +^ y <^ x +~ z 

4. X =~ y A z =~ w ^ X z y w 

5. X =^ y f\ z <^ w ^ X z <^ y +^ w 

6. X <~ y A z <~ u; ^ X +~ z <~ y +~ u; 

On iterated sums 

The last functions we need to define are iterated sums, defined as in Li using Buss's "carry-save" 
technique. 

Definition 3.2.30 



CScar = [AX1X2X3X4. CScar3(CScar3(xi,X2,X3) • 0, CSadd3(xi,X2,X3),0x4) • O] 
CSadd = [AX1X2X3X4. CSadd3(CScar3(xi,X2,X3) ■ 0, CSadd3(xi,X2,X3),0x4)] 

The following properties are a direct consequence of these definitions. 
Claim 3.2.86 

CScar3(xO, yO, zO) = CScar3(x, y,z) -0 
CScar3(xl, yO, zO) = CScar3(xO, yl, 2;0) = CScar3(xl, yO, zl) = CScar3(x, y, 2;) -0 
CScar3(xO, yl, zl) = CScar3(xl,yO, zl) = CScar3(xl, yl, zO) = Cscar3(x, y,2;) • 1 

CScar3(xl,yl, zl) = CScar3(x, y, z) • 1 
oCScar3(x, y,z) - = oCSadd3(x, y,z) = 0- omax3(x, y,z) = 0- max3(ox, oy, o^^) 
• max3(ox, oy, 0^) = CScar3(ox, oy, 02;) • = CSadd3(ox, oy, qz) 
oCScar(x, y, z, w) = oCSadd(x, y, z, w) = 00 ■ omax4(x, y, z, w) = 00 ■ max4(ox, oy, qz, qw) 
00 • max4(ox, oy, oz, ow) = CScar(ox, oy, qz, qw) = CSadd(ox, oy, oz, qw) 

We can now prove one main lemma and one main theorem about the "carry-save" addition 
functions. 



CSadds 



CScars 



^CRNa [AX1X2X3 . (('xi A^ 'x2) 0x2 A^ 'xs) ('xs A^ 'xi))] 
[AX1X2X3 .xorf (Oxi, 0x2, 0x3)] 
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Lemma 3.2.87 

(CScar3(succ''(x), y, z) ■ 0) CSadd3(succ^(x), y, z) 

=~ succ'^((CScar3(a;, y, z) ■ 0) +^ CSadd3(a;, y, z)) 

Theorem 3.2.24 CScar(x, y, z, w) +~ CSadd(x, y, z, w) x y z w 

Finally, we can define the function "sum", that adds all the bits of its argument, as in Li. 

Definition 3.2.31 

CARADD = STRN[Aa:. oa; • x, Xxv^Vr- CSc3r{v^<, ►i;^, Vr<, ►fr) • CSadd(t;^-^, >-vi, Vr<, >-Vr)\ 
CAR = [Ax. CARADD(x)^] ADD = [\x. ►CARADD(x)] 
sum = [Aa;.CAR(x) +~ADD(x)] 

The following basic properties of CARADD will be used to prove results about sum and can be 
proved easily from previous theorems. 

Claim 3.2.88 

1. CARADD(ox) =~ 

2. sum(ox) = CAR(oa;) +~ ADD(ox) =~ +~ =~ 
Theorem 3.2.25 sum(x) =~sum(x-4) +~sum(^x) 

From this theorem, it is possible to prove that sum(xy) =^ sum(x) +~sum(?/) with a sequence 
of lemmas and theorems similar to the ones used to show that AND(xy) = AND(x) AND(?/). 
In particular, we have that sum(xO) =~ sum(x) +~0 =^ sum(x) and sum(xl) =^ sum(x) +^1 =^ 
succ~(sum(x)). 

Simple proofs by NIND now show the following theorem. 

Theorem 3.2.26 sum(x) <''^ sum(ix) — \x\ 

3.3 Proving the pigeonhole principle in Ti 

In this section, we will be working with the following form of the pigeonhole principle, denoted 
PHP„(/) (or simply PHP when n and / are clear from the context): "no map / : [n + 1] — > [n] 
is injective", or equivalently "if / is a map from [n + 1] to [ra], then there exist i ^ j + 
such that f{i) = f{j)"- (Note that we are using the common notation "[n]" to represent the 
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set {1,2, . . . for any positive integer n, and in what follows, we will use the term map to 
mean a (possibly) multi-valued function.) 

Informally, the proof of PHP goes as follows: Assume for a contradiction that / is a map 
from [n + 1] to [n] and that / is injective (i.e., for every i ^ j G [n -|- 1], f{i) ^ /(j))- Define 



i.e., count(A:,£) is the number of elements in [^] mapped onto from elements in \k\ by /. Then, 
the following facts are easy to prove. 

1. count (n -|- 1,^) < £ for any \ < i <n (since there are i elements in [£]). 

2. count(l,n) > 1 (since /(I) G [n]). 

3. count(A; -|- l,n) > count(A;,n) for \ < k < n (since f{k -|- 1) must be different from 
/(I), . . . , /(/c) by the assumption that / is injective). 

Combining facts [2] and [31 we get that count(A;,n) > k for all 1 < A; < n -|- 1. But then, 
n -|- 1 < count(n -|- 1, n) < n, i.e., n -|- 1 < n, which is a contradiction. Hence, PHP is true. 

In the rest of this section, we will show how the informal proof given above can be formalized 
in Ti, in a top-down manner. Also, we adopt the following notational convention: when a 
function is defined through an auxiliary function that is of no interest in itself, the name of 
the auxiliary function will consist of the function's name spelled backwards (e.g., we will define 
below a function "map" in terms of an auxiliary function "pam"). 

3.3.1 Representation of PHP in Ti 

The first step in the formalization will be to use the machinery given above to write down 
Li-functions that define PHP. Formally, PHP = PHP„(/) depends on two parameters: n and 
/; moreover, given n, / can be described by an [n x (n + 1)] binary array whose (i, j)-th entry is 
equal to 1 if i = /(j) and otherwise, i.e., each row corresponds to one hole and each column 
to one pigeon. This is essentially the representation we will use to encode the problem. 

More precisely, given n, every bit string a can be seen as encoding an [n x (n -|- 1)] binary 
array (and therefore a partial map /" : [n + 1] ^ [n]) by either padding a on the right with O's 
or chopping off enough bits from the right of a so that its length is n • (n -|- 1), and then reading 
the array in row-major order, so that the first n + 1 bits of a (from the left) represent the first 
row of the array, and so on. For example, the string 1000110010 represents at the same time 



count(A;,^) = {x G [^] : /(y) 



X for some y G [A;] } , 



10 



the [2 X 3] binary array 



1 



the [3 X 4] binary array 



110 



etc. 



1 1 



10 
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Then, every such [n x (n + 1)] binary array a represents a partial map /" : [n + 1] ^ [n]. 

Now, given a bit string N such that n = |n|, we can define a function adj(a, n) that "adjusts" 
the length of the bit string a so that |adj(a, n)| = |n| • (|n| + 1), i.e., adj(a, n) is exactly the 
[n X {n + 1)] binary matrix encoded by a, written out in row-major order: 



Next, given a column number K and a row number L in unary, we can easily define a function 
entry that extracts a single entry (bit) of the matrix: 



Note that the value returned by this function is meaningless unless l<|K| = A;<n + l and 
1 ^ |l| = ^ < n. In order to simplify the presentation, we will implicitly assume that k and 
i fall within this range for the rest of the section, where k = |k| and i = |l| by convention 
(i.e., functions are implicitly defined by cases to be equal to e for values outside the meaningful 
range) . 

Once we have the function entry, it is easy to define functions col and row that extract 
columns or rows of the matrix, by CRN: 



So that col (a, N, k) is the k-th column of a and row(a, N, l) is the £-th row of a. Moreover, it is 
easy to prove in Ti that 



directly from the properties of CRN. 

Using these functions, we can now define two functions needed to represent PHP„(/): 



adj(a, n) = o < ((n1 # n) > a) • o(a I> (n1 # n)). 



entry(a, N, K, l) = (((n1 # L<) • K<) > adj(a, n))'. 



loc(a, n, k, l) = loc(a, N, K, L<) • entry(a, N, K, l) (for L 7^ e) 

col(a, N, k) = loc(a, N, K, n); 
wor(a, N, K, l) = wor(a, N, K<, l) • entry(a, N, K, l) (for K 7^ e) 
row(a, N, l) = wor(a, N, Nl, l). 



lc(col(a, N, k),l) = loc(a,N,K,L), 
lc(row(a, N, l), k) = wor(a, N, K, l), 



map(a, n) 



1 if /" is a map, i.e., every column of a contains at least one 1, 



otherwise; 



inj(a,N) 



1 if /" is injective, i.e., every row of a contains at most one 1, 



< 



otherwise. 
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To compute map (resp. inj), we will first define a function pam (resp. jni) that returns a bit 
string with one bit for each column (resp. row) of a indicating whether the constraint is satisfied 
or not for that column (resp. row); then, we simply take the conjunction of all the bits to get 
the answer: 

pam(a, N, x) = pam(a, N, x<) ■ OR(col(a, N, x)) (for x ^ e) 
map(a,N) = AND(pam(a, N, Nl)) , 

jni(a, N, x) = jni(a, N, x<) ■ -i^OR(delfirsti(row(a, N, x))) (for x ^ e) 
inj(a,N) = AND(jni(a, N, n)). 

Finally, we can easily define a function that represents PHP: 

php(a, n) = map(a, n) — s-^ -'''inj(a, n). 

3.3.2 The Ti-proof of PHP 

First, let us define the function count(a, N, K, l), which returns the number of elements from 
[£] that are mapped onto by elements from [k] according to f2- We do this by first defining 
tnuoc(a, N, K, l), which returns a string of £ bits, one for each of the first £ rows of a, where bit 
j is set to 1 iff row j contains at least one 1 in the first k columns: 

tnuoc(a, N, K, l) = tnuoc(a, N, K, L<) • OR(wor(a, N, K, l)) (for L e) 

Then, count(a, N, K, l) = sum (tnuoc(a, N, K, l)) . We can depict the situation as follows, where 
we have represented the submatrix of a consisting of the first k columns for each of the first i 
rows, and where the value of tnuoc(a, N, K, l) can be read bit by bit, one for each row: 

k 

' ^ tnuoc(a, n,k,l) : 

1 ••• 1 — ^ 1 = OR(wor(a,N,K,l)) 

••• — ^ = OR(wor(a,N,K,ll)) 

^ ••• 1 — > 1 = OR(wor(a,N,K,L)) 

Now, we give a formalization in T\ of the high-level proof of PHP outlined at the beginning 
of this section. To make the notation easier to read, all theorems are conditional to the fact 
that k and i arc within meaningful range. 

Recall the general outline of the proof: under the assumption that -i^php(a, n), i.e., that 
map(a, n) and inj(a, n), it is possible to prove the following two facts. 

Fact 3.3.1 count(a, N, Nl, l) <~ t (where i = |l|, by convention) 
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Fact 3.3.2 k <~ count(a, n, k, n) (where k = |k|, by convention) 

Then, we get that |n1| <~ count(a, n, n1, n) |n|, so that |n1| <~ |n| (by transitivity of <~). 
But since we know that -i^(|a;l| <^ we get php(a,N) by contradiction. 
Now, we can prove fact 13.3.11 

count(a, n, n1, l) = sum(tnuoc(a, n, n1, l)) |tnuoc(a, n, n1, l)| <^ |l| = i 

since sum(a;) <'^' |x| for any string x and |rCRN[/i](x, y)| = \x\ for any Li-function h and any 
strings x, y. 

To prove fact 13.3.21 we will first show that it is possible to prove the following two facts 
(corresponding to facts [5] and [3] in the informal proof). 

Fact 3.3.3 count(a, n, e, n) =^ 

Fact 3.3.4 count(a, n, k, n) >''^' count(a, n, k<, n) 

Then, we can use NIND to show that count(a, n, K, n) |k|: count(a, n, e, n) |e| by 
fact 13.3.3] and count(a, n, K, n) >~ count(a, N, K<, n) >'^' |k<| by fact 13.3.4] and the induction 
hypothesis, so that count(a, N, K, n) >~ |k|. 

Next, to prove fact 13.3.51 we use NIND together with the fact that wor(a, N,e,L) = e (by 
definition) to conclude that tnuoc(a, N, e, l) = qL: tnuoc(a, n, e, e) = e = qe, and assuming 
that tnuoc(a, N, e, L<) = o(l<), then tnuoc(a, N, e, l) = tnuoc(a, N, e, L<) • OR(wor(a, N, e, l)) = 
o(l<) • OR(e) = o(l<) • = o(l). Finally, we use the fact that sum(ox) =^ for any string x to 
conclude that fact 13.3.31 holds. 

To prove fact 13. 3.-1] we have to show that sum(tnuoc(a, N, K, n)) sum (tnuoc(a, N, K<, n)) . 
Intuitively, this will be true iff tnuoc(a, N, K, n) contains more I's than tnuoc(a, N, K<, n). For- 
mally, for any two strings x and y, the term AND(or|(not^(x), y)) expresses the fact that y 
has a 1 in every position where x has a 1, and the term OR(andf (not^(x), y)) expresses the 
fact that there is a position where x has a but y has a 1. Now, since we can prove that 
AND(orf(not^(a;),y)) OR(andf (not^(a;), y)) ^^sum(x) <^'sum(y) (the proof is given in Ap- 
pendix |A]) , we only need to show the following facts to complete the proof of fact 13.3.41 

Fact 3.3.5 AND(orf (not^(tnuoc(a, N, K<, n)), tnuoc(a, N, K, n))) 

Fact 3.3.6 OR(andf (not^(tnuoc(a, n,k<, N)),tnuoc(a, n,k, n))) 

It is relatively easy to prove fact 13.3.5] by NIND on the last argument: tnuoc(a, N, K<, 1) = 
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OR(wor(a, N, K<, 1)) and tnuoc(a, N, K, 1) = OR(wor(a, N, K, 1)) so 

AND (orf (not^(tnuoc(a, N, K<, e)), tnuoc(a, N, K, e))) 
= ^^OR(wor(a, N, K<, 1)) OR(wor(a, N, K, 1)) 
= -■^OR(wor(a, N, K<, l)) OR(wor(a, N, K<, l) • entry(a, N, K, l)) 
= -■^OR(wor(a, N, K<, l)) V*^ OR(wor(a, N, K<, l)) entry(a, N, K, l) 
= 1 entry(a, N, K, l) = 1. 

Also, by the definition of tnuoc, we get that 

AND(orf (not^(tnuoc(a, N, K<, l)), tnuoc(a, N, K, l))) 
= AND^orf ^not^(tnuoc(a, N, K<, L<) • OR(wor(a, N, K<, l))) , 

tnuoc(a, N, K, L<) • OR(wor(a, N, K, l))^ ^ 
= AND(orf (not^(tnuoc(a, N, K<, L<)), tnuoc(a, N, K, L<))) 

a"" (^''OR(wor(a, N, K<, l)) V"" OR(wor(a, N, K, l))) 
= 1 (^^OR(wor(a, N, K<, l)) OR(wor(a, N, K, l))) = 1 

(where the third equality holds by the induction hypothesis). 

The proof of fact 13.3.61 is the most involved so far. First, by the definition of tnuoc and 
properties of CRN, we know that 

tnuoc(a, N, K, n) = tnuoc(a, N, K, L<) • OR(wor(a, N, K, l)) • (l > tnuoc(a, N, K, n)) . 

Because OR(xy) = OR(x) 0R(2/) and andf (xy, zwz) = andf(a;,i(;) • andf(2/,z) when |x| = \w\ 
and \y\ = \z\, we get easy proofs in Ti that 

-.^OR(wor(a,N,K<,L)) A^ OR(wor(a, N, K, l))^^ 
OR(andf (not^(tnuoc(a, N, K<, n)), tnuoc(a, N, K, n))) . 

Hence, we can prove fact 13.3.61 by showing that there must exist some value L for which 
-i^OR(wor(a, N, K<, l)) A^ OR(wor(a, n, k, l)) . Now, we can prove that -■^OR(delfirsti(x)) A^ 
\b{x,y) -i^OR(lc(x, 2/<)) (see Appendix E]) . and since wor(a, N, K<, l) = lc(row(a, N, l), K<) 
and -i^OR(delfirsti(row(a, N, l))) by the assumption that inj(a,N), we only need to show that 
there is some L for which lb (row (a, N, l), k) , which is equivalent to showing that there is a value 
of L for which entry(a, N, K, l). 

Unfortunately, we do not have quantifiers to reason with so to show the existence of L, 
we have to construct it explicitly, i.e., to define a function pos(a, n,k) that gives the value 
of L. Because all functions definable in Ti are length-determined, pos will have to return 
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a hitmask to the position of L, and this bitmask cannot be used directly with the current 
definition of row to prove what we need. So, we wih define an alternate function mrow whose 
last argument is a bitmask instead of a unary string and for which we can show row(a, n, l) = 
mrow(a, n, (1 • o(l > n1))<): 

mwor(a, n, k, m) = mwor(a, N, K<, m) ■ maskbit^'(col(a, n, k), m) (for K 7^ e) 
mrow(a, n, m) = mwor(a, N, n1, m) 

With this definition, pes can easily be defined as pos(a, n,k) = firsti(col(a, N, k)). By the 
assumption that map(a, n), we know that OR(col(a, N, k)) and since we can prove that OR(a;)^^ 
maskbit''^'(x, firsti(x)) in Ti (see Appendix [X]) . we get that maskbit'^ (col(o, N, k), pos(a, N, k)) , 
which implies immediately that lb(mrow(a, N, pos(a, N, k)), k) . 

Now, because row and mrow are both defined by CRN on the same parameter K, it is 
sufficient to show that entry(o, N, K, l) = maskbit'^ (col(a, N, k), (1 • o(l > n1))<) in order to 
prove that row(a,N,L) = mrow(a,N, (1 • o(l > n1))<). And because we can prove in Ti that 
\h^{x,y) = maskbit''^'(x, (1 • o(y > x\))<) (see Appendix lAj) . this is equivalent to showing that 
entry(a, N, K, l) = lb(col(a, N, k), l) = loc(a, N, K, l)', a fact which is immediate by the definition 
of loc. 

Finally, we can redefine tnuoc using mwor instead of wor, as follows: 

mtnuoc(a, N, K, l) = mtnuoc(a, N, K, l<) • OR(mwor(o, N, K, (1 • o(l [> n1))<)) (for l 7^ e) 

and using the reasoning given above, it is possible to prove that 

OR(andf (not^(mtnuoc(a, n, k<, n)), mtnuoc(a, n, k, n))) . 

Moreover, because of the equivalence between wor and mwor given above, this implies the 
corresponding result for tnuoc, which completes the proof. 

Remark 3.3.1 Based on the Ti-proof of the pigeonhole principle just given, it should be 
possible to prove other, similar combinatorial statements in Ti. One example is Tutte's theorem, 
which states that a graph has no perfect matching iff it satisfies a certain simple form of 
decomposition. This would give an alternative proof that "perfect matching" tautologies have 
short .F-proofs, and maybe provide a more precise estimate of the size of these proofs by the 
results of Chapter HI (The "perfect matching" tautologies were first discussed in a paper by 
Impagliazzo, Pitassi, and Urquhart [21j, where it was shown that they have polysize J-'-proofs — 
note that those proofs were non-uniform, unlike the proofs we would obtain through Ti.) 
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Theorems of Ti Have Polysize 
^-proofs 

For every term t of Ti with free variables xi, . . . , x^, we define a length function lent{mi, . . . , m^) 
that gives the exact length of t as a function of the lengths of xi, . . . ,Xk (this function is well- 
defined because functions in Li are length-determined). Then, we define a family of propo- 
sitional term formulas (t)™, . . . , that describe the bits of t in terms of the bits of 

xi, . . . ,Xk (where (t)]" describes the leftmost bit of t), i.e., given any truth- value assignment to 
the atoms representing the bits of xi, . . . , x„, the truth value of {t)^ represents the correct value 
for bit number i of term t. Finally, for any formula ^ of Ti, we define a family of propositional 
translations {AJ"^, where fh lists the lengths of all free variables in A, and show that there are 
short J^-proofs of {Af^ whenever ^ is a theorem of Ti. 

4.1 Length functions 

The length functions are defined inductively as follows (where "sg" is the signum function). 





= m 








len/(i,,...,t^)(m) 


= lenj(^^_...^^.^) 


(lenii(mi), . 


. . ,lenij^(mfc))Q 






( where xi, . 


. . , Xfc occur 


in none of ti, . . 


■ ,tk ) 


leue 


= 


leno = 1 


leni = 


1 




= m 




len^a;(m) = 


m 


lenj.^(m) 


= [m/2\ 




len^a;(m) = 


[m/2] 




= m — n 




lenj^i>a; (n,m) = 


m — n 



Where rhi represents the lengths of the variables that occur in U . 
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len^.j;(m,n) 

len[A^.t](j?)(n) 
len£CRN[h]{x,y)("i,^) 
^^^TRN[g,h,he,hr] {x,z,y) i^, p, ft) 



m + n 

if m = then n else max{pQ,pi} 



m 



len. 



rCRN[h](x,y)(."^''^J = 



if m < 1 then len^j-j.^^ y^(m,p, n) else 
^<i^h{x,z,y,vo,vi) (jn, p, n, leT^TKf^[g,h,he,hr](x-^,he(z),y) ' 

^enTRNlg,h,he,hr]i>'x,hr(z),y) i^i ^ 



4.2 Term formulas 



To every variable x of Ti are associated prepositional atoms (x)™ 



For other terms 



of Ti, the term formulas are defined inductively as follows. (When subscript i is used without 
specifying its range in the definition of (t)^", it is implicitly assumed that 1 < i < lent(m).) 



(/(ti,...,tfc))r = (/(xi,...,x,)): 



lent^(rni),...,lent^(mfe) 



lent (mj)- 



J I ii 



l<ij <lent^ (mj ) 



( where xi 


, . . . , X/j 


occur in none of t\ 


. . . , tfc ) 




(0)i = 








= T 


{ox)T = 








= T 


{x<)T = 


(x)r 




(►x)r 


— (^)i+[m/2j 


1 . \m,n 


(x)r 




(y > x)^ 




(a; • y)i = 


f(x)r 

1 


^ if m < i 







(x?(2/,zo,.i))™'^^ 

([Af.t](y))f 
(£CRN[/i](x,y)) 



m,n 
i 



(rCRN[/i](x,y)): 



if m = 



(^(x):^A(o(zi<zo)-^o)r"'0 

/((^)™ A (o(zo < zi) ■ zi)f 'P^) if m > 

( where z does not occur in h{x,y) ) for m > 
( where z does not occur in h{x,y) ) for m > 



^ Where n' represents the lengths of the variables from y that actually occur in t[y/x] 
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{9{x,z,y))f'^''^ ifm<l 
(h[x, z, y, TRN[5t, h, hi, hr]{x<, he{z),y), 

\ m,p,n 

TKN[g, h, hi, hr]{>-x, hr{z),y)) ) if 1 < m 

/ i 

Remark 4.2.1 Note that "?" is the only primitive function symbol that has non-trivial term 
formulas (because it is the only function that depends directly on the values of its arguments), 
so that any non-trivial term formula must depend on ? in some way. 



(TRN[g,/i,/i,,/i,](x,z,y)>™'^'"= < 



4.3 Propositional translations 

The propositional translations of formulas of Ti are defined inductively as follows (where 
stands for any one of the binary propositional connectives, and iriQ and fhi represent the lengths 
of the variables that occur in A and B (or t and n), respectively.) 



It = nf 



A ^ it) 

l<i<lent(mo) 

_L 



mo 



(n)™-^ if lent (mo) = lenu{mi), 



otherwise. 



lA Br = iAr° isr' 



4.4 The simulation result 

Now, we can prove the following theorem. 

Theorem 4.4.1 If A is provable in Ti, then for any m, [A]™ has uniform polysize J--proofs. 

Proof The proof is by induction on the number of inferences in the proof of A. If A is an 
axiom, then Subsection 14.4.11 below shows that \A\^ has linear-size .F-proofs. If A is obtained 
by a derivation, then by the induction hypothesis, the propositional translations of the premises 
of the last inference all have short J^-proofs. Subsection 14.4.21 below shows that in this case 
also, lA]*" has short .F-proofs. Moreover, all these .F-proofs are uniform, in the sense that 
there exists a specific function that takes a theorem of Ti and the lengths of its variables into a 
.7-"-proof of the translation of the theorem. This function is described implicitly in the sections 
that follow, but it can be formalized in Ti itself, using techniques similar to those developed in 
Chapter [5] (we do not expect any technical difficulties in doing this but time constraints prevent 
us from working out the details). □ 
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4.4.1 Axioms 

For most axioms of the form t = u, just writing down the definitions of {t)i and {u)i is enough 
to see that the axiom is a theorem with short proofs since {t)i = {u)i. We give a more detailed 
argument only for a few axioms. 

0. The axioms for the propositional calculus can obviously be simulated by any .7^-system. 

1. (a) By reflexivity, there are linear-size J^-proofs of /X{x)^ ^ {^)T ^"-"^ variable x. 

(b) If m 7^ n, then the antecedent of the axiom translates to _L so the translation of the 
axiom is a trivial theorem. If m = n, then the commutativity of gives linear-size 

.F-proofs of ( A(^)r (y)?) - {Aiy}? ^ (^)r)- 

(c) If m 7^ n or n 7^ p, then one of the antecedents of the axiom translates to _L so the 
translation of the axiom is a trivial theorem, lim = n = p, then the transitivity of ^ 
gives linear-size .^-proofs of (( A(^)r iv)?) A ( Aiv)? ^ (^)D ) ^ ( A(^)r i^fi) ■ 

(d) An easy induction on the structure of the function symbol /, together with properties 
of is sufficient to show that there are short .^^-proofs of {A{^i)T^ ^ ivi)!'^ ^ 

• ■ • A A(^/c)r - {yk)T) -> ■ ■ ■,Xk))r'-''^' - ifiVu . . ■ ,y.))r'-'"'= when 

mi = ni, . . . , rrtfe = rik (the axiom's translation becoming a trivial theorem otherwise 
as one of the antecedents translates to _L). The size of these proofs is linear if / does 
not contain any function defined by TRN; it is polynomial otherwise (since the size 
of the term formulas can be polynomial in the lengths of the variables) . 

2. (0)i = ± and = T. 

3. (a) For alll < z < m + 0, {x ■ e)™ = (a;)f*, and for alU < z < m + n + 1, 



{x ■ yO) 



m,n 
i 



{x)Y^ Hi <m 

„. a i — m < n 
>i-m-n if n < i - m 



((^ ■ y) ■ o)r' 




m n if i < m 

{x ■ y)™'" = { ii i < m + n 

if m < i 



^ (0)j_(^+„) ifm + n<i 

(similarly for x ■ yl = {x ■ y) ■ 1). 

(b) {x-y = e]"*'" holds iff m + n = 0, and {x = e Ay = e]"*'" holds iff m = and n = 0. 

(c) fx ■ y = 0|™'"' holds iff m + n = 1 and {x)\ = _L or {y)\ = _L. Similarly for a; • y = 1. 
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4. (a) For 1 < i < m, {e > x)Y^ = {x)'^q by definition; also, for 1 < i < m — (n + 1), 
{{Oy) \> x)™''" = (a;)™^^-^ = {y \> a;)^'" = (0 > (y [> x))™"'", and the same reasoning 
applies to ((ly) > x)™'". 

(b) lenoc>£ = 0^1 = = len^, and for 1 < i < m + 1 ^ 1, (0 [> (Ox))f = (Ox)[:]ii = 

= (similarly for (0 [> {Ix))^). The same reasoning applies with 

"1 > . . . " in place of "0 [> . . . " . 

(c) leny|>^(m, n) = m — n and len^|>j^o("^) = len2;>yi(m, n) = (n + 1) — m, and m — n = 
O^m — n<0<->m<n<->m<n+l<-i>0<(n+l)— m^(n+l)— mT^O. 



5. (a) Similarly to Hal 

(b) Similarly to [4bl 

(c) Similarly toHcl 

6. (a) loe = el = T since len^^ = len^ = 0, and for 1 < i < m+ 1, (o(xO))™ = (o(a;l))r = -L 

{(ox)™ if i < m, 
(0)i_m if m < i. 

(b) Similarly to [Ual 

7. (e ? (x, y, 2))™''"'^ = (x)j for 1 < f < m, by definition; also, for 1 <i < max{n,p}, 

(M) ? (x, y, = (^(^0) A (o(z < y) • yvr) 

= A Uz < y) . y)H V (± A (o(y < • z)^^) 
-(o(z<y)•y)^'^ 
? (x, y, .))^™'"'^ = A (o(. < y) • y>r'^) 

v(M)tiA(o(y<]^)-z)n 
= (^T A (o(z < y) • y)^'^) V (T A (o(y < .) • z)^) 
^ (o(y < 2) • z)"'^. 

8. (a) For all 1 < i < m, 

f(x^)"^ = (x)"^ iff<[m/2j 

I (►^)r-L™/2j = (^)r-LW2j + L-/2j if L"^/2J < ^ 
(b) x-^ <1 ►x = e since lena;.^(m) — len^2^(m) = \m/2\ — \m/2\ = 0; 

1 > (x-^ > ►x) = e since (len^2^(m) — len2:.^(m)) — 1 = \m/2\ — [m/2j —1 = 0. 

The last four axioms are easy to prove if we note the following two facts. 
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(a) For any term t,t <\t = e and therefore {o{t < t) ■ t)^ = {t)f' for 1 <i < lent(m). 

(b) Within any J^-system, the identity (-^p A V (p A g) g has hnear-size proofs. 
Together, these facts show that 

( (y)f ifm = 0, 
(z)f ifm>0. 

9. (a) Note that len(3.o)^(m) = [{m + l)/2j = \m/2] and 



= if \m/2] - [m/2j = then [m/2j else [m/2j + {\m/2]+l- \m/2]) 
= \m/2l 

so both terms do have the same length. 

Now, for 1 < i < rm/2], ((xO)^)f^ = (xO)f^ = and 

{x< >>-x? {x<,x< ■ ((►x • 0) < >-x),x< ■ ((►x • 0) < 
'(x-^)f* if rm/2] = [m/2j 

^ {xM ■ ((►x • 0) < if rm/2] > [m/2j 

™ if [m/2] = [m/2j 



^ < 



^ < 



if i < [m/2j 



(((►x-0)<^x))7^ ifz>[m/2j 

if [m/2] = [m/2j 



if rm/2] > [m/2j 



{X 



{x)f ifi<[m/2j 
^ (x)f* if i = [m/2] 



if [m/2] > [m/2j 



(x) 



since (((►x • 0) < ►x))^' = {x)i\.[m/2} ■ '^^^ identical, 
(b) Note that len^(^o)("i) = \{m + l)/2] = [m/2j + 1 and 

^^^x-*[»-x?{>-x-0,l>{>-x-0),H>{*-x-0))ilTT') 

= if [m/2] ^ [m/2j = then [m/2] + 1 else ([m/2] + 1) - 1 
= lm/2\ + 1, 



so both terms do have the same length. 
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Now, for 1 < z < [m/2j + 1, 



(►(xO))r = {xO) 



j+L(m+l)/2j 



(0)i 



if i = [m/2j + 1 



and 



^ < 



(x^ > ►x ? (►x • 0, 1 > (►x • 0), 1 > (►x • 0)))^ 
(►X • 0)™ if [m/2] = [m/2j 

(1 > (►x • 0))7* if [m/2] > [m/2j 

(►x)™ if i < \m/2\ 

' " if [m/2] = [m/2 

(0)i if i > [m/2j 
(►X • 0)Yli if rW2l > lm/2 

(►x)™ if i < [m/2j 

(0)i if i > [m/2j 

(►x)^i iff<Lm/2j 

(0)i ifi>[m/2j 



if [m/2] = [m/2j 
if [m/2] > [m/2j 



(0)i if i = [m/2j + 1 

The case for xl is identical. 

(c) Similarly to [9al 

(d) Similarly to [9B 

10. ([Ax.t](x))f = {t[x/x])f = {t)f for all 1 < i < lent(m'). 

11. (a) Since len,cRN[h](e,j/)(^^) = 0, [£CRN[/i](e, y) = ef = T. 

Also, because len(^z-o)<z{p) = p + l — p=l, we have that 

(£CRN[/i](Ox,y))r'" = {h{z,y) • 0)^ [(Ox)-/(z)™+i] 

= (/i(Ox,y)-0)r" 

= ((M0x,y)-0)<M0x,y))r'" 

= (((/i(Ox,y) -0) </i(Ox,y)) • ^CRN[/i](x, y))™'" 



(similarly for Ix), 
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and for 1 < i < m + 1, 



<j<m+2—i 



m+1 — (i— l),n 



{h{z,y)-oy; 

(^CRN[/i](x,y))™f 

(((/i(0x,y) • 0) < /i(0x,y)) • ^CRN[/i](x,y))r'" 



(similarly for Ix). 
(b) Similarly to lllal 

12. For all 1 < i < lenTRN[g,/i,/i^,h,](x,2,y)("^>i^' '^)' and by the remark above, 



{x<l? igix,z,y),t,t)y; 



m,p,n 
i 



{9{x,z,y))f"' if 1 = 

(^(x<l)-^^A(o(t<t)-C"'") 

V((x < 1)™^, A (o(t < t) • t)r'^'") if m - 1 > 

{9{x,z,y))f'' ifm^l = 
'P'" if m - 1 > 



where t = h{x, z, y, TRN[5t, h, hg, hr]{x<, hi{z),y),TRN[g, h, he, hr]{>-x, hr{z),y)) . 



4.4.2 Rules of inference 

For all the rules in Definition 13.1.31 if one of the premises contains an equation of the form 
It = n]™ that degenerates to _L because lenf(mo) 7^ lenu(mi), then the rule becomes trivial. 
We therefore assume that none of the propositional translations of atomic formulas of Ti are 
degenerate cases. Also, when we use the notation "y^" with no subscript, we implicitly assume 
that the conjunction is over all relevant values of the index of the term formulas involved. 

0. Any standard, complete set of rules for the propositional calculus can be p-simulated 
within any jF-system. 

1. We have short .F-proofs of iTd]*"'". Substituting (t)^ for (x)™ throughout these proofs 
yield short /"-proofs of [^[t/x]/'". 
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2. (a) First, a few observations. Let F = {x = eyx = 0->x\/x = l- >x). Then, 



X = e 



lx = 0- >xi 



T if m = 
_L if m > 

(x)r-(o->x)r 
(^)r-(o)iA^(x)r^(>x)-i 

Ki 

(x)r^±A/\(x)r-(x)r 

Ki 



fx = 1 • >x} 



TA/\(x) 

Ki 



SO that 



{FY 



TV±V± ifm = 

±y{{x)T^±A/ki<^{x)r--{^)r) 

V((x)r - T A Ai<,(x)™ ^ (x)™) if m > 

T if m = 

[ (((x)r ^ ± V (x)r ^ T) A hi<^{x)T (^)r) if > 
T if m = 



(x)f V (x)f if m > 



Therefore, l-F]™ has hnear-size .7^-proofs. 
Now, we have short .F-proofs of 

As = lA[e]f, 

Ao = lA[x]r^ ^ [A[Ox]l"'", 
and Ai = [^[x]]™'" ^ [A[lx]r'". 

If m = 0, then by Axiom [Td] there are short proofs of 

Ix = er^(I^r'"^[^[e]f), 

which shows that there are short proofs of 

ix = er^ms]f^iAr^). 

If m > 0, then substituting 1 > x for x in Aq gives short proofs of 

|^[1 > x]r'" ^ IA[0 ■ >x]r'"; 



(4.4.1) 
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moreover, by Axiom [Tdl there are short proofs of 

ix = o- >xr ^ (M™'" ^ i^[o • >x]]™'"), 

and therefore, by transitivity, of 

ix = o- >xr ^ ([^[1 > 2;]]™'" ^ i^r-"). 

A similar argument shows that there are short proofs of 

which, together with (j4.4.ip . impUes that there are short proofs of 

Applying modus ponens to this and |-F]™ gives short proofs of 

IA[1 > x]]™'" ^ I^r-". (4.4.2) 

Repeated substitutions of 1 [> x for x in the proof of (j4.4.2p give short proofs of 

lA[m > x]!"^'" ^ [^[r^T^ > x]]™'^ 

(where we remind the reader that "/c" is a shorthand for 1 • • • 1). 

Since m \> x = e, using Axiom [id] and modus ponens gives short proofs of 

lA[e]f^lA[m>x]r'', 

and using transitivity m + 1 times now gives short proofs of 

lA[s]f^lAr'''. 

A final application of modus ponens with gives the short proofs of [^J'"'"' we 
wanted: as can easily be seen, the size of this proof is 0{m ■ p{m, n)), where p{m, n) 
was the maximum size of the proofs of A;,, Aq, Ai. 

(b) The same reasoning as for part (a) applies. 
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3. We have short proofs of [.4[e,z]p'", [A[0,z]f and 

If m = 0, using modus ponens twice on formula (j4.4.ip gives a short proof of l^]™''^'". If 
m > 0, then by repeatedly substituting first x<, h£{z) and then ►x, hr{z) for x, z in the 
proof of A' , we get a binary tree of short proofs of formulas of the form of A', where the 
formula at the root is lA[x, z]]'"'^'", the formula at each node is implied by the conjunction 
of the formulas at its children nodes, and at the leaves, the terms being substituted for u 
can all be proved to be equal to or 1 (single bits of x). 

For example, if m = 3, the tree would have the form depicted in Figure [3.4. II (where we've 
indicated only the consequent of the formula being proved at each node, so that a node 
B with children C and D represents a proof of the formula {C A D) ^ B and a node B 
with one child C represents a proof of the formula C ^ B). 



A[x, z] 




A[Q,z']AA[l,z'] A[{*'x)<,hi{hr{z))\ ^[►►a;,/i.(/i.(z))] 



A[{),z']AA[l,z'] A[^,z']AA[l,z'] 

Figure 4.4.1: Proof tree for m = 3. 

Therefore, the proofs of |A[0, z]]^'" and [^[1, z\Y''^ can be used with Rule [T] (substituting 
the right terms for z) and modus ponens to prove the formulas at the first level, and going 
up level by level using modus ponens, we obtain a proof of the consequent of the formula 
at the root of the tree, i.e., [A]™'^'". Moreover, if q{m,p,n) is the maximum proof size 
of the premises and c is a constant satisfying \h£{z)\ < 2^\z\ and |/ir(-2)| < then the 

size of this proof is 0{m ■ q{m,p ■ m^, n)) since the tree has depth no more than [log2 m] 
and thus size no more than 2m. 

Remark 4.4.1 Note that the estimates on the size of .F-proofs for Ti's theorems given above 
might be used to get more precise upper bounds than are currently known for the size of J-- 
proofs of certain families of tautologies. For example, the family of tautologies arising from the 
pigeonhole principle was first shown to have polysize .F-proofs by Buss [8j (whose estimate of 
the size of the proofs was 0{n'^^)); a careful analysis of the Ti-proof given in Chapter [3] together 
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with the results of this chapter could provide a better bound (the details would be somewhat 
tedious but straightforward), and the same could be done for the other families of tautologies 
mentionned at the end of Chapter [3l 



Chapter 5 

Ti Proves the Soundness of T 



In this chapter, we will show how to formalize a particular ^-system in Ti, how to formalize 
Buss's algorithm for the Boolean Sentence Value Problem (BSVP) [10] in Ti, and how to use 
the BSVP algorithm to prove the soundness of the given ^-system in Ti. Then, we show that 
T provably p-simulates any proof system S whose soundness can be proved in Ti. 

5.1 Formalizing ^-systems 

Because any two J-'-systems p-simulate each other, we will focus on the particular jF-system 
below: 

language: variables pi,p2, ■ ■ ■ , constants T and _L, connective brackets ( ) 
formulas: T, _L, pi (for any i > 1), and recursively, {A B) for any formulas A and B 
axiom schemes (for any formulas A, B, C): 

1. ^A^iB^A)) 

2. {(A -^{B^ O) - {{A -^B)^{A^ C))) 

3. [{{B^L)^{A^1.))^{A^B)) 

4. T 

rule (modus ponens): A,{A^B) h B 
5.1.1 Formulas 

Given a formula A of we will encode A into a string in the following way: Let wa = 
max{i : pi appears in ^4} and = 1 + [lg(tt;^ + 1)J (one more than the binary length of wa)- 
Then, for 2^ > £^^ + 2, #2^^ is obtained from A by using the following Godel-numbering scheme 
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(where ^ G {0, 1}^ ^ represents i in binary using exactly 2^ — 2 bits — note that because 
of our choice of 2^, this will always start with a 0). 



symbol 


code 


Vi 


oo(z)r-^ 


_L 




T 


001l2'-3 


( 


1002'-2 




1102'-2 


) 


0102'-2 



In fact, because we can count in Ti, it is possible to define a slightly more complicated encoding 
where the codes for "(", and ")" include information about the logical depth of the symbol 

(making sure that is adjusted to be the maximum of its old value and the logical depth of 
A). In what follows, we will use "(j") S'^d to represent the corresponding symbol 

at a logical depth of j. 

For example, the formula A = [p2 ((pio ^ -L) ^ P2)) can be rewritten as ^ = (0P2 ^0 
(1(2^10 ^2 -L)2 ^1 ^2)1)0 by including the depth information for each symbol except logical 
variables and constants, which would be encoded as follows (where the string was split in two 
to fit on the page, and a little bit of space was added between blocks of bits representing each 
symbol, for readability): 

#8^ = 10000000 00000010 11000000 10000001 10000010 00001010- ■ ■ 

■ ■ ■ 11000010 00100000 01000010 11000001 00000010 01000001 01000000. 

With this encoding, it is easy to write a function formula(a;, z) in Ti that returns 1 if 
X = #2'^^ for some formula A such that |pow^(z)| = 2^ > + 2, or otherwise. The function 
is defined by rpowCRN and simply checks that x is one of "7^2'^-'-") "7^2'^~'~", "#2'^Pi") or that 
X has the form "(„• • • )a" and that each symbol in x is preceded by a valid string of symbols, 
according to the following simple rules (using counting and masking operations): 

• and "T" must immediately follow either "(" or 

• "(j" must immediately follow either "(j-i" or 

• must either immediately follow "(jPi'S "(j-L") or "(jT", or it must follow, in order, 
"(j(j+i- ■ ■ )j+i" (where everything between the parentheses has depth at least j + 1); 

• ")j" must either immediately follow ^^^jPi\ "^j_L", or "^jT", or it must follow, in 
order, "^^(j+i- • • )j_|_i" (where everything between the parentheses has depth at least 

i + 1)- 
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5.1.2 Proofs 

Now, we can encode .F-proofs easily. A proof Ai, . . . , Ai^ is encoded by a pair of strings: 
(#2£^i, . . . , #2«^fc)fc and (ji, . . . , jfe)fc, where 2^ > maxi<i<fc{^A,} + 2 and 

(0,m) if is an instance of axiom m, 
{ki,k2) if Ak2 = Aki ^ Ai. 

Again, it is straightforward to write a function proof (x, y, z, w) in Ti that returns 1 if x, y encode 
an .7^-proof for 2^ = |pow^(2;)[ and k = \w\, or otherwise, by using simple masking operations 
and rpowCRN. (Technically speaking, the encodings of the formulas in a proof must be padded 
so they all have the same length, but it is a simple matter to take care of.) 
Finally, we can define in Ti the following function: 

)7r!^!(x) if proof (x,y,2;,'u;) = 1, 

F[x,y,z,w) = < 

t#|powi(^)|T otherwise, 

that returns the tautology proved by x, y (or some fixed tautology if x, y is not an ^-proof). 



5.2 Buss's algorithm for the BSVP 

For reference purposes, we will now summarize Buss's most recent published ALOGTIME 
algorithm for the BSVP [10]. Actually, we present a slight variation of his algorithm applied to 
formulas containing only the connective as opposed to "A" and "V" . Given such a Boolean 
sentence, we can represent it as a binary tree with 2'^^^ — 1 leaves for some d (we can pad any 
sentence so that it meets this condition by preceding it with enough copies of "T — s- ■ ■ ■ " ) , where 
each leaf stores either "T" or "±" and each interior node represents the connective "— For 
two nodes U and V in this tree, we write "C/ > V" to mean that U is an ancestor of V, and 
"[/ ^ V" to mean U = V or U > V. The least common ancestor of U and V is denoted 
lca([/, V). By convention, we draw trees with the root at the top and the leaves at the bottom, 
so that "above" and "below" correspond to "ancestor" and "descendant" , respectively. Also, we 
define a scarred sentence as a binary tree whose leaves store T or ± and that contains exactly 
one internal node with only one child (the missing child is called the scar). The "value" of a 
scarred sentence is defined to be a pair of truth- values {tT,t±), where tj is the value of the 
Boolean sentence obtained when the scar is replaced by T, and similarly for t±. 

The algorithm will be described as a pebbling game on the formula's tree between two 
players: the Pehhler and the Challenger^ and it proceeds in rounds. During each round, the 
Pebbler places pebbles labelled with a truth-value "0" or "1" on nodes of the tree, representing 
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assertions by the Pebbler that the subformulas rooted at those nodes have the indicated truth- 
values. Following the Pebbler's move, the Challenger challenges one of the pebbled positions 
[/, representing an assertion by the Challenger that the pebble value at \J is incorrect (and 
implicitly, that every pebbled position below \J is correct). 

Intuitively, the essential feature of the pebbling-challenging game is to break up the work 
by creating scarred subsentences and evaluating them at the same time as their scar, instead 
of performing the evaluation sequentially. (For example, we could evaluate (A — > i?) — C by 
evaluating (A — > ^ C in parallel with i3, where "V^" indicates the position of the scar.) 
Together with Buss's innovative technique for finding scar positions in a semi-oblivious fashion 
through distinguished leaves, this feature of the algorithm allows even unbalanced sentences to 
be evaluated in a logarithmic number of steps. 

The game is designed so that the Pebbler has a winning strategy if the value of the sentence 
is "1"; otherwise, the Challenger has a winning strategy. Many of the rules of the game 
might seem somewhat arbitrary and strict, but they are designed so that a play of the game 
can be evaluated in ALOGTIME while preserving the property that the correct player has a 
winning strategy. For example, the game will never last more than d rounds (when there are 
2^^+! — 1 leaves), and since specifying arbitrary pebble positions would require 0{d) bits (which 
would take us outside ALOGTIME), there must be a "semi-oblivious" way of specifying pebble 
positions using only a constant number of bits per round. 

Before giving the details of the pebbling game, we need to introduce a bit more notation. 
First, leaves will be numbered from left to right, starting with 1, and assigned a rank equal to 
the largest integer k such that 2'^ divides the leaf number. Next, in each round i > 1, there 
will be distinguished leaves Li, Ci, and Ri (for "left", "center", and "right", respectively) and 
distinguished nodes Ai and Bi (for "above" and "below", respectively), satisfying the following 
conditions (see Figure [5.2.11 for a picture). 

1- Ai\>Bi\> d, with Ai = Bi only if Bi = d; 

2. Ai is the lowest (and latest) challenged node, while Bi is the highest pebbled position 
satisfying the first condition — or Bi = Ci if there is no pebbled node below Ai (informally, 
the players have "agreed" at Bi but "disagree" at Ai); 

3. Li and Ri are distinct leaves of rank d — i and Ci is of rank greater than d — i; 

4. every leaf in the subtree rooted at Ai but outside the subtree rooted at Bi has number in 
the range 

{Li - 2'^-\Li + 2^^-') U (i?, - 2'^-\ Ri + 2<^-') 
(where the intervals are open). 
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The pebbling game proceeds as follows. In round 0, the Pebbler must pebble the root node 
with value "1", and the Challenger must challenge the pebble at the root. In preparation for 
round 1, set Ai to be the root, Bi = Ci to be the leaf numbered 2*^, and Li and Ri to be the 
leaves numbered 2^*"^ and 2'^ + 2'^~^, respectively (see Figure [5.2.21 for an example where each 
leaf's number and rank is indicated). In round i > 1, let Ui = lca{Li,Ci) and Vi = lca{Ci, Ri) 
(note that Ui and Vi are distinct because Li and Ri are distinct). 




number: 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 
rank: 0102010301020104010201030102010 

Figure 5.2.2: Labelled example of the BSVP algorithm at round 1. 
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In each round, the Pebbler uses six bits of information (one per node) to pebble Ui, Vi, and 
their two immediate children {Ul, Vj^ to the left and Uf, to the right, respectively). In 
addition, the Pebbler must use three bits of information to specify the relative positions of Aj, 
C/j, and Vi (i.e., which nodes are ancestors of which ones — since all three nodes are ancestors 
of Cj, they all lie on the path from Ci to the root). 

The Challenger then challenges one node from among Aj, U} ^ Uf, Vi, V^ , or V^ (using 
three bits to specify which one), subject to the conditions that the node challenged must be in 
the subtree rooted at Ai and outside the subtree rooted at Bi. 

For round i + 1, Aj+i is set to the node just challenged, -B^+i is set to the highest pebbled 
node below ^i+i (or to Cj+i if there is no pebbled node below ^i+i), and -Lj+i, Cj+i, Ri+i are 
set according to Table 15.2. 1[ 



Challenged 
Node 


Pebbler says 
Ui > Vi 


Pebbler says 
Vi > U, 


Ul 


Cj+i = Ui 

Ri+1 = Li + 2^^^* ^ 

T T r)d—i~l 

^i+1 — i-ii — A 


Ci+i = Li 

Ri+1 = Li + 2'^~^~^ 

T T nd—i—l 
^i+1 — ^i — Z 




Ci+1 = Ci 

-ftj+i — Hi -\- z 
Li+i = Li + 2'^ * ^ 


Ci+1 = Ci 

— tii — Z 

Li+1 = -Lj + 2*^ * ^ 


vl 


Ci+i = Ci 

n rj nd—i — l 

tM+l — Ui — Z 

Li+i = Li + 2'^ * ^ 


Ci+1 = Ci 

T> P rjd—i—1 

— tii — Z 

T T r,d—i—l 
J^i+1 — J^i — Z 




Ci+i = Ri 

Ri+1 = Ri + 2^^"*"^ 

^i+1 — rCi — Z 


Ci+1 = Ri 

Ri+1 = Ri + 2*^"*"^ 

r p nd—i—1 

J^i+1 — Ui — Z 


Ui or Vi 


Game Ends 


Game Ends 



Challenged Node: Ai 


Pebbler says 
Ai > Ui,Vi 


Pebbler says 
Ui,V, \> A, 


Pebbler says 
Ui>Ai> Vi 


Pebbler says 
Vi\>Ai> Ui 


Ci+1 = Ci 

p P _L nd—i—1 

Ui+i — Hi + Z 

T T r,d—i—l 
^i+1 — l^i — Z 


Ci+1 = Ci 

p p nd— t — 1 

-Kj+l — Hi — Z 

Li+1 = Li + 2'^ * 


Ci+1 = Ci 

Ri+1 = Ri + 2'^~'^^^ 

Li+1 = Li + 2'^ * ^ 


Ci+1 = Ci 

n p nd—i — l 

T T nd—i—1 
ti+1 — Lii — Z 



Table 5.2.1: Next leaf nodes in Buss's BSVP algorithm. 
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Now, it is easy to show by induction on the number of rounds played that properties 1~4 
are preserved for the duration of the algorithm: it is simply a matter of checking case-by-case 
each possibility in Table [572.11 for the values of Cj, Lj, and Ri. For example, suppose that IS 
challenged in Figure then for round 2, A2 = V^, B2 = C2 = Ri = 24, L2 = Ri - 4 = 20, 

and R2 = Ri + 4: = 28 (we refer to leaves by their number), so A2 > B2 = C2, A2 is the lowest 
challenged node and B2 = C2, L2 and R2 have rank 2 = 4 — 2 and C2 has rank 3 > 2, and 
every leaf in the subtree rooted at A2 but outside the subtree rooted at B2 has number in the 
range (20 - 2^-^, 20 + 2^-2) u (28 - 2^"^ 28 + 2^-^) ^ (^g^ 24) U (24, 32). 

The game ends as soon as one of the players makes an "obvious" mistake, i.e., one from 
the following list. (Note that by property 4, the game must end by round number d because 
[Ld - 2'^-'^, Ld + 2'^-'^) U {Rd - 2'^-'^, Rd + 2'^~'^) = {Ld, Rd}] it is easy to see that in that case, 
one of the two players will be forced to make a mistake from the list below.) 

• Pebbler: when the input nodes of a gate are either leaves or are pebbled and the output 
node is pebbled incompatibly. 

• Challenger: when the output of a gate whose input nodes are either leaves or pebbled is 
challenged, even though it is correctly pebbled. 

• Pebbler: when a leaf is incorrectly pebbled. 

• Challenger: when a correctly pebbled leaf is challenged. 

• Pebbler: when a node is pebbled with both "0" and "1" . 

• Pebbler: when an incorrect assertion is made about whether Ui [> Vi, Ai \> Ui, Ai \> Vi. 

• Challenger: when the challenged node is above a previously challenged node. 

• Challenger: when the challenged node is at or below a previously agreed upon pebble 
value (a pebble is "agreed upon" if it was placed in an earlier round and in that round, 
the Challenger challenged an ancestor of that pebble). 

It is straightforward to see that the game produces the correct result: if the value of the 
sentence is "true" , the Pebbler can win the game by simply pebbling every node with its correct 
value and making assertions compatible with the structure of the sentence, while if the value 
of the sentence is "false", the Challenger can win the game by always challenging the lowest 
incorrectly pebbled node that is not below a previously agreed upon node. 

Moreover, the game can be translated into an ALOGTIME algorithm, as follows: First, 
simulate possible plays of the game using existential moves for the Pebbler and universal moves 
for the Challenger. Then, for each such game, existentially guess the first mistake made and 
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universally verify that no earlier mistake was made. Note that from a play of the game, it is easy 
to determine the last round when Lj was computed from Rj and to compute the appropriate 
sum of powers of 2 to add to in order to get Lj (the same goes for As for Cj, simply 
find the last round when Cj was equal to Lj or Rj and we know that = Cj . Finally, because 
it is possible to count in ALOGTIME, ancestors can readily be computed to find Ui, Vi and 
thus determine Ai and Bi. 

5.3 Formalizing the BSVP 

The algorithm that we will use to solve the BSVP in Ti is simply a formalization inside our 
theory of the algorithm described in the previous section. To formalize this algorithm inside 
Ti, we will define a function BSVP that decomposes and evaluates the sentence, using TRN 
to perform the work in parallel, disjunction (OR) over all possible Pebbler guesses to emulate 
existential moves, and composition and implication to emulate Challenger's universal moves. 

First, it is easy to define by rpowCRN a function sentence(f , x, z) that takes as argument a 
truth- value assignment v (represented simply by a string whose 1^* bit is the value of pi , whose 
2"*^ bit is the value of p2, etc.) and the encoding of a formula x = #|powi'(z)|^) ^"^^ returns the 
sentence obtained by substituting the given truth-values for the variables in x. 

Next, we define the function BSVP(d, h, m, s) that does the work according to Buss's al- 
gorithm. There will be one variation: because we want the function to apply to arbitrary 
sentences, but a sentence must have a power of 2 minus 1 leaves in the algorithm, we will pad 
sentences so they have 2*^+^ — 1 leaves and remember the position of the root of the original 
sentence inside the padded version as "M". (The algorithm needs to be changed so that it 
takes the distinguished node M into account at the same time as A, B, L, C, R, but the 
changes are easy to make since M remains fixed for the duration of the algorithm.) In what 
follows, the parameter s is fixed and encodes a superformula of the Boolean sentence we are 
evaluating (padded so it always has a power of two minus one leaves) , the parameter m is fixed 
and indicates the root M of the subformula of s whose value we are interested in, the parame- 
ter h varies and represents the history of the game so far, as a sequence of blocks biai ■ ■ ■ biUi 
(each block a constant-length string encoding Pebbler's guesses on the relative positions of the 
nodes A, U, V, B, and M (in bj), as well as Challenger's chosen node (in a^)), and d varies and 
represents 2 to the power of the current round number, in unary. The function BSVP((i, h, m, s) 
returns a truth-triplet {c,tT,t±), where c is a check-bit indicating whether h is s, valid descrip- 
tion of the structure of s or not, and {tj, t±) is the value of the possibly scarred subformula of 
M picked out by h. The function will be defined by TRN on d, following Buss's algorithm, but 
first, we must specify how truth-triplets can be combined in various ways. 
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We generalize disjunction and implication to truth-triplets, and define composition of truth- 
triplets, as follows: 

ic^,t\,t\_) W (c^^T,^i) = (c^ Vc^(c^ At\) V {c^ At\),{c^ At]_) V (c^ At^)), 
(c\iV,ii)^(c',tT,ii) = (c^ A 0^,4^4, ti^ti), 
{c\t\,ti)o{c^,t\,tl) = (c^ Ac2,t,\,ti^) 

(where t^2 is equal to tj if i j = T and if = ±, and similarly for t^2 )• 
Then, for h = 6iOi • • • biUi, we can define BSVP(d, h, m, s) as follows: 

composition of BSVP(^d, /i6a, m, s)'s using o and — i^, 
based on structure induced by b and where a picks out • 
.different subformulas at the current round 
Because b has a fixed length, the disjunction actually represents a fixed number of cases, each one 
of which has a unique structure determined by the value of b. We will not list all possible cases 
here (they can easily be written down from the description of Buss's algorithm and Table [572. ip . 
but we give two illustrative examples based on the sentence depicted in Figure 15.2.21 

1. Consider the sentence depicted in Figure [5.3. H where we have filled- in the unique interior 
node that represents "m" and each leaf that falls inside the correct intervals around 
Li and At round 1, we have that BS\/P{d,h,m, s) = BSVP(^(i, /iba^/i , m, s) 
BSVP(^(i, hbajj2 ,m,s), where "ajyi" and "a[/2" are fixed-length strings representing which 
subsentence is selected, and "6" is the unique fixed-length string representing the correct 
structure of the formula. The other parts of the formula (under V^) are of no interest 
because they fall outside "m" . 




rank: 0102010301020104010201030102010 
Figure 5.3.1: Labelled example of the BSVP algorithm at round 1. 

2. If we look at the first recursive call of BSVP in the preceding case, we have the situ- 
ation depicted in Figure I5.3.2[ in which case it is easy to see that BSVP((i, h, m, s) = 



BSyP{d,h,m,s) = y 

all Pebbler 
euesses b 
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BSVP(^(i, hhaA, m, s) o (^BSVP(^d, hhajji ,m,s) ^ 
(bsvp( ►d, hhajj2 ,m,s) o (BSVP(^(i, hbayi . 



m, s 



BSVP(^(i, hbav2 , m, s) 




rank: 0102010301020104010201030102010 
Figure 5.3.2: Labelled example of the BSVP algorithm at round 2. 



At the last round (when d = 1), the history h is analyzed and compared with the actual 
structure of the sentence, and the check-bit c returned is T iff they agree. The check-bit can be 
obtained by taking the OR of a bit-string computed by rpowCRN on h, where the bit output 
for each block 6 in /i is 1 iff 6 is correct. Moreover, each of these bits can be computed by 
finding the positions of the L, C, and R leaves from the first part of the history h (which is 
easy to do by TRN) and then finding least common ancestors of these leaves (which again can 
be done easily by TRN). 

At the same time, the actual sentence left at the last round will have one of the simple 
forms shown in Figure 15.3.31 which can all be evaluated trivially since we know the values of 
L, C, and R, and we can easily check when B = C. 

A A A 






L 



B 




A 

L 



A 

C 



A 
R 



Figure 5.3.3: Base cases for BSVP. 



There remains one technical detail that needs to be taken care of in the definition of BSVP: 
as given, the definition is not a proper application of TRN since the value of BSVP((i, /i, m, s) is 
recursively defined in terms of BSVP(^(i, /i', m, s) for more than one value of h' . The situation 
can easily be remedied, in the following way. First, the bounded over all values of b can be 
implemented with a subtree whose depth is equal to the number of bits in b (which is a constant) , 
where a single bit is added to b at every level (to get all possible values of b at the bottom) and 
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W is used to combine the values at each node in the subtree. Next, the composition of values 
based on the structure determined by b can similarly be carried out step-by-step using a subtree 
of constant depth, where at each step, the value of b is used to determine which function (o or 
— w^) to apply to combine the results, and which bits to add to a to select subformulas. Both of 
these steps only require increasing the length of d by a constant factor, and using powmod^ to 
determine which level is being evaluated in the subtrees. 

Finally, we can define a function VALUE(t>, x, z) that computes the truth- value of the formula 
encoded by (x, z) under v: 

VALUE(z;,x,z) = AND(BSVP(2'^ x leaves(x, z), e, mask(x, 2;), sentence(f , pad(x, z), z))), 

where pad(x, z) pads the formula encoded by x and z, adding enough copies of "T ^ ■ ■ ■ " to 
the left so that it has 2'^"'"^ — 1 leaves for some integer (5 > 0, mask(x,z), returns the position 
of the root of the formula encoded by x inside pad(x, z), using a bitmask (where the root is 
indicated by its main connective "^"), leaves(x, z) returns a string of length 2^ (for the same 5 
as above), and k is the fixed number of bits in one block "6a" of the history. All these functions 
are easily defined in Ti using rpowCRN and TRN, as follows. 

First, note that a formula with n leaves always contains exactly An — ^ = n + 3(n — 1) 
symbols (1 for each leaf, and 3 for each connective: two parentheses and one connective), each 
one encoded by a block of length |pow^(2;)|. This means that the function 



returns the number of leaves of the formula encoded by x, in unary. Also, we can check whether 
a formula has a power of 2 minus 1 leaves or not with the function ispow^. Hence, if we define 
leaves(x,2;) = sevael(x • (7 x pow^(z)),z), where 



then the string x • (7 x pow^(z)) contains 4n — 3 + 7 = 4(n + 1) blocks of bits of length pow^(2;) 
so that leaves(x, z) contains exactly 2^ bits for any formula x that contains between 2^ and 
2'^"'""^ — 1 leaves, inclusive. 

Next, we define by TRN a function padding(y, if, z) that returns a balanced sentence con- 
taining exactly \y\ leaves, each one having the value T, where the logical depth of each symbol 
is at least \w\ and the length of each symbol's encoding is |pow^(2;)|: 



numleaves(x, z) = powdiv^((x • (3 x \30\n^{z)))«,z) 




if X <^ 8 X pow'(2;), 
) • sevael(^x, z) otherwise. 



padding(y,i(;, 2; 



■) = { 



\ 



y (£,#|po^L(^)|T) ify = £,0,l, 
#|powi'(2)|(|«;rPadding(y-^,u;l,2;) • #, 
• padding(^y,u;l,2;) • # 

|pow^(2)|)|io| otherwise. 
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Now, we can easily define 

pacl(x, z) = X (if X has a power of 2 minus 1 leaves), 
pacl(x,2;) = #|po,„L(2)|(o-padcling(numleaves(x, l> leaves(x, z), 1, z) 

■ #|powt(2)| ^0 •deepen(x, z) ■ #|powt(2)|)o (otherwise), 

where deepen (x, z) is easily defined by rpowCRN to add 1 to the logical depth of every symbol 
in the formula x. Finally, we can define mask by rpowCRN: 

mask(a;, z) = the unique connective — > at logical depth inside pad(x,2:) 

(if X has a power of 2 minus 1 leaves), 
mask(x,z) = the second connective at logical depth 1 inside pad(x,2;) (otherwise). 

5.3.1 Proof of correctness in Ti 

Theorem 5.3.1 T\ proves VALUE(v, #|po^L(2)|T, z) = 1, VALUE(t;, #|po^L(2)[_L, z) = 0, and 
for arbitrary formulas M and N, 

VMUEiv, #|powi(.)|(M ^ N), z) = VALUE(r;, #|pow^(,)|M, z) VALUE(i;, #|powi.(.)|A^, z) 

(i.e., VALUE is intensional) . 

Proof The first two statements follow directly from the definition of the functions involved. 
The third statement follows from Claim [F. 3. II below. □ 

Claim 5.3.1 

1. If h picks out a supersentence of (M N), possibly scarred at B, in the sentence encoded 
by s, then BSVP(ci, h, lca(m, n), s) = BSVP((i, h, m, s) — BSVP((i, h, n, s) (where lca(m, n) 
is a mask indicating the position of the least common ancestor of the nodes masked by 
m and n in the sentence s). 

2. BSVP((i, e, m, s) = BS\/P{d' ,£,m' , s') for all values of the parameters that represent the 
same sentence, i.e., given x = i^iA, for any s, s' that are supersentences of x, where m, m' 
and d, d' are defined appropriately. 

Proof 

1. By induction on d. When d = 1, only the first four cases of Figure [5.3.31 apply. Suppose 
we are in case 4; then, A = [M — > A^) and BSVP(d, /i, lca(m, n), s) = (c, r, T) (where r 
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is the value of node R), BS\/P{d,h,m,s) = (c, T,_L) (since M = B is the scar), and 
BSVP(d, /i,n, s) = (c, r, r), so the statement is true. The other three cases are similar. 

Now, if h picks out a supersentence of {M ^N), possibly scarred at B, then consider the 
following cases. 

(a) If BSVP(d, /i, lca(m, n), s) = BS\/P{>-d,hba,\ca{m,n), s), then hba also picks out a 
supersentence of (M —>■ N), possibly scarred at B, and the statement is true by the 
induction hypothesis. 

(b) liU>V = {M^ AT), then 

BSVP(d,/i, lca(m,n),s) 

= BSVP(^(i, hbayi, lca(m, n), s) BSVP(^(i, hbay^, lca(m, n), s). 

Also, 

BS\/P{d,h,m,s) = BSVP(^d, /i6ayi, m, s) = BSVP(^d, /i^ayi, lca(m, n), s) 

and 

BSVP(c?, h, n, s) = BSVP(^d, hbav2,n, s) = BSVP(^d, hbay^, lca(m, n), s) 

by the induction hypothesis. Hence, the statement is true. (The case when V > 
U = {M ^N) is similar.) 

(c) lt{M^N)=U>V, then 

BSVP(d, /t, lea (m,n), s) = BS\/P{>-d,hbaiji,\ca{m,n),s)^ 
(BSVP(^d, hbaij2,\ca{m, n), s) o BSVP(^d, hbay, lca(m, n), s)). 

Also, 

BSVP(d, h, m, s) = BSVP(^d, hbaui , m, s) 

and 

BSyP{d,h,n,s) = BSVP(^d, /i6a[/2 , n, s) o BSVP(^d, /t6ay, n, s), 

where 

BSVP(^d, hbav, lca(m, n), s) = BSVP(^d, hbay, n, s) 

= BSVP(^(i, hbavi,n,s) BSVP(^d, /i6ay2, n, s). 

Since t (ti o ^2) = {t — ^ ^1)0^2 for any truth-triplets t, ti, and t2 such that t 
is unscarred (i.e., = t±), the statement follows immediately by the induction 
hypothesis. (The case when (M N) = V > U is similar.) 
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(d) If [/ > (M — > A^) > V ^ then assuming M > y (the other cases being similar), we 
have that 

BSVP(d,/i,lca(m,n),s) 

= BSVP(^d, /i6a[/2, lca(m, n), s) o BSVP(^(i, hhay , lca(m, n), s). 

Also, 

BSVP((i, /i, m, s) = BSVP(^(i, /i6a[/2,m,s) o BSVP(^d, /i6ay , m, s) 

and 

BSVP(d, /i, n, s) = BSVP(^d, hbau2 ,n,s), 

where 

BSVP(^(i, hbav, lca(m, n), s) = BSVP(^(i, hbav,m, s) 

= BSVP(^(i, hbayi,m,s) BSVP(^d, hbay2,m,s). 

Since (ti o t = (ii — »^ t) o t2 for any truth-triplets t, ti, and ^2 such that t 

is unscarred, the statement follows immediately by the induction hypothesis. (The 
case when U > (M N) > y is similar.) 

(e) The case when (M N) > U,V is similar to the last one, and all cases can be 
suitably simplified when B > U oi B > V. 

2. We will actually prove that for all supersentences s of x, given a mask m for j; in s and 
a suitable value for d, BSVP(d, e, m, s) = BS\/P{dx,e,mx, Sx), where Sx, rux, and dx are 
the default values for x. This will be proved by induction on the number of leaves of 
the sentence encoded by x. If x encodes a sentence with only one leaf, then x either 
encodes "T" or "_L" , in which case there is a unique history h that picks out x from any 
given sentence s containing x. For that history, it is easy to see that BSVP(d, e, m, s) = 
BSVP(1, /i, m, s) which is equal to the value of x, and the same is true for the default 
values of s, m, and d. 

Now, suppose that x encodes a sentence M' = (M — > A^) with k > 1 leaves, and let s be 
any supersentence of x. Then, 

BSVP((i, e, m, s) = BSVP((i, e, m, s) ^ BSVP(d, e, n, s) 

= BSyP {dx,e, nix, Sx) ^ BS\/P{dx,e,nx,Sx) 
= BS\/P{dx,e,m'x,Sx) 

where the first and last equality are true by Claim 15.3. It fT]) , and the middle equality is 
true by the induction hypothesis. □ 
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5.4 The soundness proof 

5.4.1 Preliminciries 

If we let TR[)E{v,x,z) = formula(a;, z) A'^\/AL[)E{v,x, z), then we can express the fact that our 
.?^-system is sound with the following statement in Ti: TR[iE{v, F{x,y, z,w), z) = 1. 

We will show that Ti can prove this statement, by induction on the parameter w (which 
indicates the number of lines in the proof encoded by x,y). For this, though, we will have to 
define a form of "strong induction" in Ti. First, we need to define a notion of prefix for strings: 
"y E , defined below, returns 1 when y is a prefix of x, otherwise. 

y C X = y =® \c{x,y). 

Next, we will formalize the notion of "part-of" quantifiers in Ti. More precisely, we will show 
how to represent the part-of quantifications ^yi^^ A[y] and VyCr ^iu] for any fixed formula 
A. Since we have shown that is equivalent to = in Ti and that the connectives of Ti are 
equivalent to their functional counterparts, we can replace = with =®, with -i^, etc. inside 
A[y] to obtain a term A{y) for which we know Ti can prove A[y] A(y) = 1, and this for each 
value of y. Then, if we define 

cat A{yi) = cat A{y) ■ A{yi) 
by CRN, it is immediately clear that 

/\A[y]^AND(catA(x)) = l, 

y\Zx 

y A[y] ^ OR(catA(x)) = 1. 

Now, suppose that for a particular formula A, we can prove in Ti that A[e] and ^y^^x ^[v] ~^ 
A[xi]. Can we conclude that A is true? An easy NIND on x proves ^y^^x ^[v] ^i- 

1. AyCe^iy] — ^[^]' which we can prove by assumption; 

2. assuming that Aj/Cx^iy]' application of modus ponens gives us A[xi], which implies 
that Aj/Cxi^M Isolds by properties of AND and the definition of cat^i. 

Hence, Ti proves AyCx^ty]' which implies, in particular, that A[x] holds. 

5.4.2 The proof 

Theorem 5.4.1 Ti proves JR\JE{v, F{x,y,z,w),z) = 1. 
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We use the "strong induction" described above to prove the theorem. The proof itself will be 
quite short. 

First, F(x,y,z,e) = #|powi-(^)|T, and TRUE(t;, #|po^L(^)|T, z) is obviously equal to 1 by 
Theorem 15.3.11 From now on, we will implicitly assume that F{x^ y, z, vS) is not equal to 

Next, assume that y^^,-^ TRUE(z;, y, z, li), z) = 1, and consider the following cases, 
based on the value of y\w\\-, for the value of TRUE(t', ^(x, y, z, tt;l), z). 

• If y|u,i| = (0, 1), then y, 2, wl) is the encoding of a formula of the form (5— > A)), 
in which case Theorem 15.3.11 implies that 

TRUE(t>,F(x,y,z,ti;l),z) = 

TRUE(t;, #|powi(.)|^, z) ^« (TRUE(7;, #|powi.(.)|^, z) TRUE(^;, #|powi.(.)|^, ^)) , 

which is obviously equal to 1 in Ti, being a simple tautology. 

• The other three axioms can easily be dealt with similarly. 

• If y\w\\ = (^1,^2)5 then we know that F(x, y, z, ^2) = 7r['^^'(x) encodes a formula A^^ = 

— > A\^i\, where A}.^ is the formula encoded by F{x,y, z,ki) = 7r['^^'(x) and A^^i^ is 
the formula encoded by F{x,y, z,wl). But then, by the induction hypothesis, we know 
that TRUE(t), y, z, /ci), z) = 1 and 

TRUE(7;, F{x, y, z, £2), z) = TRUE(7;, F{x, y, z, ki),z) -^'^ TRUE(u, F{x, y, z, wl),z) = 1, 
so it immediately follows that TR\JE{v, F{x,y, z,wl), z) = 1. 

5.5 Simulation results 

In this section, we show that J- can p-simulate any proof system S whose soundness can be 
proved in Ti. A similar result was first proved for PV by Cook [18J (Krajicek gives a more 
detailed proof in his book [231 Theorem 9.3.17]), but to our knowledge, this is the first theory 
of ALOGTIME reasoning for which such a result is shown. 

Intuitively, the proof hinges on the fact that for any formula j4, the propositional translations 
of the Ti equation "TRUE(f , 7^2*^) l) = 1" can be proven equivalent to a substitution instance 
of A itself, so that if Ti proves the equation, then ^ is a tautology with short .F-proofs. 

More precisely, recall from Chapter 2] that for any equation t = n of Ti , we defined a family 
of propositional translations |t = -u]™'" that have polysize J'-"-proofs whenever t = n is a the- 
orem of T\. Hence, for any formula A and string L such that |l| = 2^ > + 2, if Ti proves 
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TRUE(i;, #2*^) l) = 1, then there are polysize .F-proofs of the corresponding propositional 
tautologies |TRUE(?;, #2<^) l) = 1]^ (where k is the length of the free variable v), and these 
tautologies are defined as (TRUE(t', 7^2*^) l))^' <-> T, which is equivalent to (TRUE(t;, #2^^) 
(the term formula describing the first and only bit of the term TRUE(t;, #2*^; l) as a func- 
tion of the bits of its free variable v). Also, since Ti can prove TRUE(w, 7^2'^(^ ~^ = 
TRUE(u, #2*^^) l) — >^ TRUE(t;, #2*^-^) l) for any formulas A and B, there are polysize .F-proofs 
that the corresponding propositional translations are equivalent, i.e., 

(TRUE(t;, #2.(A ^ B),l))\ ^ ((TRUE(^;, #2.^, l))^ ^ (TRUE(t;, #^,B, , 

so that if Ti proves TRUE(t;, #2* {-^ ^ = 1, then there are polysize ^-proofs of 

(TRUE(t;, #2. a l))? - (TRUE(^;, #^,B, l))^. 

Applying this reasoning recursively shows that if Ti proves TRUE(u, #2*^^) l) = 1, then there 
are polysize ^-proofs (call them Ilfc) of A[(TRUE(w, 7^2*^1 (i.e., the formula A where 

each propositional variable pi has been replaced by the formula (TRUE(u, #2^Pi) l))^). Since 
the subformulas (TRUE(t>, #2'^P«) are never "broken up" inside 11^, we can just substitute 
Pi for (TRUE(?j, #2'^Pi) throughout to get polysize .F-proofs of vl. 

Now, since J- can "evaluate" sentences (i.e., given a propositional formula with no variables, 
J- has polysize proofs that it is equivalent to its truth- value) , for any function /(xi, . . . ^Xn) 
definable in Ti, and any tuple of strings ai € {0, 1}*, ■ ■ ■ ,an € {0, 1}*, there are polysize T- 
proofs that {f{x))'^[{aj)i^/{xj)'^'] <-5- (/(a))j (where {aj)i is equal to T or ± depending on the 
value of bit number i of aj). In particular, if S" is a proof system formalizable in Ti as a function 
symbol S{x, z), then for any particular S'-proof (a, b) of a formula A, there are polysize ^-proofs 
that S{a, b) is equivalent to the encoding of A. 

Putting these two facts together, we have that if S{x, z) is a proof system whose soundness 
can be proved in Ti (i.e., for which Ti can prove TRUE(f , S{x, z), z) = 1), then for any particular 
S-proof (a, b), there are polysize .F-proofs of the formula encoded by S{a, b), i.e., JT p-simulates 
S. Moreover, it appears that the translation from S'-proofs to J^-proofs can be carried out in 
NC^ and thus formalized in Ti, where the simulation proof can also be formalized (although 
we do not carry this out, we do not expect any technical difficulties to arise in the details of 
such a formalization). 

Remark 5.5.1 Note that Theorems 14.4.11 and I5.4.T] immediatelv give an alternative proof 
that .F-systems have polysize proofs of their own partial consistency (when suitably expressed), 
a fact first proved directly by Buss [9]. The partial self-consistency statements obtained through 
Ti would be different from the ones Buss considered, but it should be possible to prove that 
they are equivalent. 
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Related Work 

In this chapter, we show that Aral's AID is equivalent to QTi (a suitably defined quantified 
version of Ti), and briefly discuss the relationship between Ti and Clote's ALV or ALV' . We 
will keep the discussion at a high level, with few technical details. 

6.1 Ti and AID 

If we define QTi to be a first-order theory whose non-logical symbols are those of Ti and 
whose axioms are the universal closures of the axioms of Ti, together with axiom schemes 
corresponding to NIND and TIND, then we can show that 

• QTi is a conservative extension of Ti (the proof is similar to Cook's proof in [17] that 
QALV is conservative over ALV'); 

• for every Sg-formula B in QTi, there exists a function symbol B in Ti such that QTi 
proves B[x]<^B{x) = 1 (sharply bounded quantifiers, e.g.^Vx < \t\" , are easy to represent 
functionally since we already have "part-of" quantifiers from the end of Chapter [5] and 

< |t|-B[x] Azct example); 

• QTi proves the scheme of Sg-LIND (with a straightforward application of NIND). 

Next, the primitive functions of AID are all easily defined in Ti (all treating their inputs 
"numerically", i.e., ignoring leading zeroes), and their defining axioms can be proven without 
difficulty. Also, for every inductively defined predicate A^'^'^'^ in AID, we can define a {0, 1}- 
valued function A in Ti such that the equation A{x,p) = 1 provably satisfies the defining axioms 
(A.0)-(A.2) of A. This can be done by TRN in a relatively straightforward manner, except for 
two technicalities that we discuss now. 

First, the recursive definition of A in AID involves computing the values of predicates 
Di{x,p), . . . , Dm{x,p) at every level of the recursion, even though these computations can only 
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be represented in Ti by function symbols of rank 1 {i.e., the computations are in NC ). Since 
functions of rank 1 are not allowed in the recursive part of a definition by TRN, we need to 
precompute the values of the D predicates for every level and extract the correct values during 
the recursive definition by TRN. This is accomplished by first computing the concatenation of 
all the values of p for which the D predicates need to be computed, then using CRN to compute 
the concatenation of the D predicates for each value, and finally breaking up this string in the 
appropriate way during the recursion (so the order in which the values are listed is important 
and must be chosen carefully). 

Second, the depth of the recursion is controlled by the "linear form" i\\x\\, which is repre- 
sented in Ti by a numerical function, i.e., one whose actual string length could be arbitrarily 
longer than its numerical length because of leading O's. By simply extending the precompu- 
tation carried out above for the D predicates so that the values of the B predicate are also 
computed for each value of p at the "bottom" of the recursion, we can easily define the term 
A by TRN so that the recursion terminates early once the base cases are reached, so that the 
exact string length of the parameter controlling the TRN does not matter, as long as it is long 
enough. 

In the other direction, every primitive function of Ti can easily be defined in AID, using 
Aral's representation of strings (a string h^-i ■ ■ - bo is represented by the integer Ibk-i ■ ■ - bo), 
since the primitive "part" function of AID can be used directly to extract arbitrary substrings. 
Prom the properties of "part" , the defining axioms for the Ti -functions can be proven without 
difficulty in AID. Also, function symbols defined by ^CRN or rCRN in Ti can be defined in AID 
in a straightforward manner using Comprehension, and the defining axioms for these functions 
follows directly from the Comprehension axiom in AID. Finally, functions defined by TRN in Ti 
can be defined in AID using an inductively defined predicate A^'^'^'^ that uses its parameter p 
to keep track of a path through the recursion tree and that computes the appropriate substring 
of X and the appropriate function of z at each level using the predicates D. Moreover, the 
defining axioms of such a function symbol follow directly from the axioms for A in AID. 

Now, for every formula B in AID, we let B denote the formula in QTi obtained from B by 
replacing each primitive function symbol by its definition in Ti and each inductively defined 
predicate symbol A by its definition ^ in Ti. Similarly, for every formula B in QTi, we let 
B denote the formula in AID obtained from B by replacing each function symbol of Ti by its 
definition in AID. 

These translations allow us to show that if a formula B is provable in AID + Sq-CA, then 
QTi can prove B (since QTi proves Sq-LIND and Eq-CA can be defined using CRN and proven 
using NIND), and that if a formula B is provable in QTi, then AID + Sq-CA can prove B 
(since AID + Sq-CA can prove NIND using Comprehension, and TIND in a way similar to 
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Aral's proof of "tree induction"). 

Hence, AID is equivalent to QTi, which implies that AID is conservative over Ti. Moreover, 
since Aral proves in his paper that AID is equivalent to QALV for S^-formulas, this also implies 
that ALV is equivalent to Ti (modulo the translations from numbers to strings and from strings 
to numbers given above). Unfortunately, the corresponding result for ALV' and Ti is not known. 

Now, even though AID is conservative over Ti (through the appropriate translations between 
strings and numbers), Ti appears to be more natural and easier to reason with for a variety of 
reasons. 

• Ti reasons directly with functions in FALOGTIME, whereas AID reasons only with 
predicates (functions have to be defined implicitly). 

• Ti's scheme of TRN is simpler than AID^s inductive definitions, in the sense that a func- 
tion defined by TRN carries out only simple computations at each level of the recursion 
(i.e., the h, h^, and hr functions can be computed by constant-depth circuits), unlike the 
computations requiring logdepth circuits carried out by the "D" predicates at each level 
of an inductive definition. 

• It seems quite tedious to work out precise estimates on the size of J^-proofs for the 
propositional translations of the Sj-theorems of AID, whereas the corresponding task for 
Ti is straightforward (so in a sense, Ti is "closer" to .F-systems than AID). 

6.2 Ti and PV 

Based on a similar result of Buss pJJ, which uses Krajicek, Pudlak & Takeuti's Herbrand-type 
witnessing theorem [21], Cook [T^ has argued that if QPV is conservative over QALV , then 
P = ALOGTIME, where QPV is the appropriately defined quantified theory corresponding 
to PV and QALV is a quantified theory based on Clote's ALV' . A similar result should hold 
with QTi in place of QALV, given a suitable interpretation of strings as numbers and numbers 
as strings. 

As for the quantifier- free theories PV and Ti, it is easy to see that if PV is conservative 
over Ti (through an appropriate translation between numbers and strings, such as the one used 
above), then Ti proves the soundness of eJ- (since PV proves the soundness of eT and eJ^ can 
be defined in Ti), so that T p-simulates eT. Unfortunately, the converse is not known, and 
this has no known implication for the complexity classes P and NC^. 



Chapter 7 



Conclusion 



7.1 Summary 

As Chapter [2] has shown, Li is an elegant and natural recursive characterization of NC^: 
simple functions are easy to define and even for more complex functions, the definitions are 
not unnecessarily complicated. The only exception to that statement might be the "numerical" 
functions, but even there, the definitions are straightforward and correspond quite closely to the 
computation of these functions by circuit families. Also, our scheme of TRN seems to capture 
the computational power of ALOGTIME in the most natural way, as evidenced by the short 
proof that FALOGTIME C Li. It would be interesting to prove the other direction also (that 
Li C FALOGTIME) by using computations by ATMs as opposed to uniform circuit families, 
so that both directions of the proof are similar, but time constraints did not allow us to carry 
out such a proof. 

The theory Ti based on Li has the desirable property that its appropriately translated the- 
orems have short ^-proofs, and the proof of that fact is quite simple (especially when compared 
to the corresponding proofs for other theories of ALOGTIME reasoning in the literature). In 
fact, the structure of the JT-proofs is straightforward enough that we get precise estimates on 
their size (as a function of the lengths of the variables). Also, considering the inherent com- 
plexity of evaluating Boolean sentences in ALOGTIME ^ the Ti-proof of the soundness of a 
particular .F-system is straightforward, consisting mainly in the formalization of Buss's BSVP 
algorithm and the proof of its properties. Finally, the fact that J- p-simulates any proof system 
S whose soundness can be proved in Ti is also straightforward to prove, and Ti is the first 
theory of ALOGTIME reasoning for which this result has been shown. All these facts strongly 
support our claim that Ti is one of the most natural theories available for ALOGTIME rea- 
soning, even though it is based on strings instead of numbers (unlike most of the other theories 
for polytime or ALOGTIME reasoning). 
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To conclude, it might seem that any algebraic characterization of a complexity class could 
be used to define a quantifier- free theory like Ti, by simply having function symbols defined 
recursively and induction rules based on the recursion operators. Although such a theory would 
undoubtedly reason on functions in the desired complexity class, we would still need to show 
that the type of reasoning that can be carried out in this theory also falls within the desired 
complexity class, and there is no clear way of doing this for arbitrary complexity classes. Also, 
as evidenced by Clote's theories ALV and ALV', it is not an easy task to get a theory that is 
natural and simple enough to be useful in practice. 

7.2 Future work 

First, an obvious generalization of Lq and Li suggests itself: for i > 1, let Lj+i be the closure 
of Li under COMP, CRN, and TRNI^'"''^, defined recursively. A study of these classes (or 
of a similar extension for the theory Ti) might be interesting. (One fact about Li which is 
relatively easy to prove is that it is a subset of the class of functions computable by uniform 
circuit families of ©(log*) depth, but it is unknown if this is a proper containment. It might 
be interesting to try to prove better results, maybe that the Lj's exactly captures these circuit 
families, or that the union of the Lj hierarchy defines the class of functions computable by 
uniform circuit families of polylog depth.) 

Next, it would be interesting to compare QTi to Takeuti and Clote's TNC^, maybe to show 
that the two first-order theories are equivalent. Also, from Aral's results on AID and ALV, we 
have concluded in Chapter [S] that Ti and ALV are equivalent, but it is unknown whether or 
not ALV' is equivalent to ALV (or to Ti). 

It would also be interesting to see if "tree recursion" and "tree induction" can be adapted 
to define a similar quantifier-free theory for uniform TC^ reasoning, hopefully as natural as 
Ti is natural for ALOGTIME reasoning (such a theory would correspond to bounded-depth 
.7^-systems with threshold gates in the same way that Ti corresponds to .F-systems). Note that 
there is already a first-order theory ^2 for TC^ reasoning, defined by Johannsen |22j . 

Similarly, there should be a way to extend Lq and Li to capture all of NC (based on Bloch's 
characterization of NC). This could possibly be used to define a quantifier- free theory for NC 
reasoning, which might lead to a natural propositional proof system that reasons in NC. 

Finally, fully relating conservativity results between logical theories for P and ALOGTIME 
reasoning to equivalence results between eJ- and J- systems to collapse results between P and 
NC^ remains a central open problem in this area. 
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Details of Proofs in the Formal 
Development of Ti 

This appendix contains most of the proofs missing from the formal development of Ti given in 
Chapter [3l It is included here mainly for the sake of completeness, so the style will be quite 
terse. In particular, most proofs that consist only in a straightforward application of NIND will 
be omitted. 

On generalizations of NIND 
Claim 13.2.541 

1. (L) z\>yx = z\>y-{z<]y)\>x (R) xy<\z = x<\{y\>z)-y<\z 

2. (L) y<\> X = \b{x,y) ■ y \> X (R) x <i >y = x <1 y ■ rb{x,y) 

3. (L) y \> {{x <} y) [> x) = e (R) {x <i {y [> x)) <} y = e 

4. (L) \c{rc{x,y),y) = rc{x,y) (R) rc{\c{x , y) , y) = \c{x , y) 

5. (L) \y\>x) =y>x'?^^ [e,{x<i>{y\>x)y) (R) {x <iyy = x 'Qy'?^^ [e,\{x <iy)<\> x)) 

6. (L) \c{x,yi) = \c{x,y) ■ \b{x,yi) (R) rc{x, iy) = rb{x, iy) ■ rc{x,y) 

7. (L) \c{x,y) ■ y \> X = X (R) x = x <\ y ■ rc{x,y) 

Proof 

1. (L) By NIND on z, Axioms Sc] and [5cl and Claim I3.2.32t e\>yx = yx = e\>y-e\>x = 
£ > y ■ {e <} y) > X, iz [> yx = >{z > yx) = >{z \> y ■ (z <\ y) [> x) = z [> y (>((-z <y) > 
x), >{z \> y) ■ {z <\ y) \> x) = z > yl^^ [e ■ {i ■ z <y) > x,iz \> y ■ e \> x) = z \> yl^^ [iz \> y ■ 
{iz <\ y) \> X ,iz \> y ■ {iz < y) \> x) = iz \> y • {iz < y) \> x. 
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2. (L) By NIND on y: eOx = x = e-x = lb(x, e)-el>x, {yi)<>x = yt>x = ^{y\>x)->{y>x) = 
^{{yi)< \> x) ■ {{yi) > x) = \b{x,yi) ■ {yi) \> x. 

3. (L) By NIND on y, and preceding claims: e \> ((x < e) \> x) = x \> x = e, 

yi > {{x < yi) t> x) = >{y t> {{x <\ y)< > x)) 

= y> >(lb(x, x <y) ■ {x <y)> x) 
= y > {{x <\y) > x) = e 

(Note that in this proof, we did not exphcitly deal with the cases when x <\y = e ox when 
[x <\ y)<[> X = e. However, it can easily be seen that both cases make the statement 
trivially true.) 

4. (L) By preceding claims: lc(rc(x, y), y) = ((x<ly) Ox) <\{y\> ((x<ly) Ox)) = rc(x, y)<\£ = 
rc(x,y). 

5. (L) By NIND on x and preceding claims: ^{y \> e) = e = y \> e 1^^ (s,^), > xi) = 
y > xi {e,\y O X • i)) = y t> xi (e,y O x (i,Xy O x))) = y t> xi (e,y O x 

[i, (x < >{y O x))')) = y \> xi (e, {xi < (y O x))') = y > xi (e, {xi < >(y o xi))'). 

6. (L) By preceding claims: lc(x, yi) = x < (yi O x) = x <1 >(y Ox) = x < (y O x) • y O x 
(e, (x <1 >(y O x))') = X < (y O x) • Xy o x) = lc(x, y) • lb(x, yi). □ 

On prepositional reasoning 
Theorem 13.2.61 

1. R^Bx = 1 V f«^x = 

2. «^^R^Bx = «^x 

3. = 1 ^ ^(s:i^X = 1) 

4. X y = 1 ^ {k.^x = 1 a Ri^y = 1) 

5. X y = 1 ^ (s:i^x = 1 V f^^y = 1) 

6. X y = 1 ^ {^.^x = 1 ^ Ri^y = 1) 

7. x y = 1 ^ («^x = 1 ^ ^ 
S. X 0^ y = 1 ^ {k.^x = 10 f»-^y = 1) 

Proof The first three can be proved by straightforward NIND on x, all the others with a 
simple and direct application of Derived Rule 13.2.31 □ 
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On "AND" and "OR" 

Lemma 13.2.561 

1. (L) x^eAx^OAx^l^ {>x ■ j)< = >{x<) ■ Vx 
(R) x/eAx/OAx/1^ ►(>x • j) = »-x ■ j 

2. (L) ^2;A»AND(>x-j) = AND(x) A«j (R) j A« AND(x) = AND(j • x<) A« x' 

3. 'x A^ AND(>x) = AND(x) = AND(x<) A^ x' for x / e, 0, 1 

Proof 

1. (L) By Derived Rule l3.2T]l since e = e, the statement is trivially true for x = e. Assuming 
ix 7^ A zx 7^ 1, i.e., x ^ e, 

(>(ix) ■ j)< = {xj)< 

= X (x-^,x-^ • \^x ■ j)) 

= X (x-^,x-^ • Vx) 

= X {x« ■ xW, >{i ■ x<) ■ Vx) 

= X {>{i ■ x<)< ■ \{i ■ x<y ■ ►x), >{i ■ x<) ■ Vx) 

= >{{ix)<) ■ V(ix). 

(A similar proof shows the same theorem with z • x< in place of >x • i.) 

2. (L) By the preceding lemma and by TIND on x: A^ AND(>e • j) = = AND(e) A^ j, 
^iA^AND(>i-j) = iA^AND(j) = AND(i)A^i, and assuming that \x<^)A^AND(>(x<^)-j) = 
AND(x<^) A^j and'(^x) A^AND(>(^x)- j) = AND(^x) A°j, then, for x such that >x / e 
(the case when it is equal being trivial), 

'x A^ AND(>x • j) = 'x A^ AND((>x • A^ AND(^(>x • j)) 

= \x<) A« AND(>(x^) • Vx) A^ AND(>^x • j) 
= AND(x^) A^ Vx A° AND(>^x • j) 
= AND(x^) A^ AND(^x) A« j 

= AND(x) A^j. 

3. By preceding lemmas and by Derived Rule l3.23] on x: ^ij) A^AND(>(ij)) = iA^AND(j) = 
AND(ij) = AND(i) A^ j = AND((ij)<) A^ (ij)', \ikj) A« mD{>{ikj)) = i A« AND(fej) = 
AND{ikj) = AND{ik) A^ j = AND{{ikj)<) A^ (ikj)' , and under the induction hypotheses 
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thatXx-^)A«AND(>(x^)) = AND(x) = AND((x^)<) A«(x^)' and ^^x) A«AND(>(^x)) = 
AND(a;) = AND((^2;)<) (►x)' for x< >^ 1, then 

'x A^ AND(>x) = V A^ AND((>x)^) A^ AND(^(>x)) 

= X {\x<) A" AND(>(x^)) A^ AND(^x), 

\x<) A^ AND(>(x^) • '(►x)) A« AND(>(^x))) 
= x?^^ (AND(x^) A^ AND(^x),AND(x^) A^X^^^) A^ AND(>(^x))) 
= X (AND(x^) A^ AND(^x), AND(x^) A« AND(^x)) 
= x?^^ (AND(x),AND(x)) 
= AND(x) 

= x?^^ (AND(x),AND(x)) 

= X (AND(x^) a'' AND(^x), AND(x^) A^ AND(^x)) 

= X (AND((x^)<) A^ (x^)' A^ AND(^x), AND(x^) A^ AND(^x)) 

= X (AND((x^)<) A^ AND((x^)' • (►x)<) A^ (►x)', 

AND(x^) A^ AND((^x)<) A^ (►x)') 
= AND((x<)^) A^ AND(^(x<)) A^ x' 
= AND(x<) A^x'. □ 

On generalizations of CRN — part I 
Theorem 13.2.91 jX = jy*-^x\>y = e = x<\y 

Proof By Derived Rule l3.2.3l jS = jV^e = y^£>y = e = £<iy, jx = jE^x = e^xOe = 
e = X <\e, j{xi) = j{ky) <-^jX-j = j-jy^jX = jy<^x\>y = e = x<\y-<-^xi\>ky = e = xi<\ ky. 
□ 

Claim I:-{.2.57I For f = r CRN[h] , 

1- j{f{x,y)) = jX 

2. f{x,y) <\z = f{x<\ z,y) 

3. Ib(/(x, y), z) = X [> z ((0 • /i(lc(x, z), y))', e) forx,z^e 

4. \c{f{x,y),z) = f{\c{x,z),y) 

Proof 

1. By NIND on x: j{f{e,y)) = je, j{f{xi,y)) = j{f{x,y) ■ (0 • h{xi,y))') = jx ■ j = j{xi). 
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2. By Derived Rule|3.2.3[ /(e, y) < z = e < z = f{e<lz,y), f{x,y)<is = f{x,y) = f{x<ie,y), 
f{xi,y) < zj = {f{x,y) ■ (0 • h{xi,y)y)<<i z = f{x,y) <\ z = f{x <\ z,y) = f{xi <\ zj,y). 

3. First, a straightforward proof by NIND on x shows that ^f{x,y) = (0 • hQx,y)y. Now, 
by NIND on z ^ £ and the claims above: lb(/(x,y),j) = ^(e > f{x,y)) = (0 • h(^x,y))' = 
(0- Mlc(x, j),y)y, 

lb(/(x,y),zi) = \z> f{x,y)) 

= z > fix, y) (e, (/(x, y) < Hz > f{x, y)))') 

= z>x?''{e,{f{x,y)<}>{z>x)y) 

= z> x?^^ {e,f{x < >{z > x),y)') 

= z[> x?^^ {e,f{x < (z > x) • \z > x),y)') 

= z> x?^^ (e, (0 • h{x <i{z\> x) ■ \z > x), y))') 

= x\>zi ((0 • /i(lc(x,z) • lb(x,zi),y))',e) 

= x>zi {{{)-h{\c{x,zi),y))\e). 

4. By NIND on lc(/(x, y), e) = e = /(lc(x, e), y), and 

lc(/(a;,y),zi) = lc(/(x,y),z) • \h{f{x,y),zi) 

= /(lc(x, z), y) • X > ((0 • /i(lc(x, zi), y))', e) 

= z>xr^ (/(lc(x, z), y) • e, /(lc(x, z), y) • (0 • /i(lc(x, zi)M') 

= z > X (/(lc(x, zf), y), /(lc(x, zi),y)) 

= f{\c{x,zi),y). □ 

Lemma 13.2.581 

1. (L) y [> X = X > y (y [> X, e) (Rj x < y = x > y (x < y, e) 

2. (Lj (y < (x > y)) [> X = y > X fRj x < ((y < x) [> y) = x < y 
Proof 

1. (L) Immediate from the fact that xt>y^e^yt>x = £. 

2. (L) By NIND on y: (e < (x > e)) > x = (e < e) > x = e [> x, 

(yj <] (x > yj)) [> X = X [> yj T''^ ((yj < e) > x, (yj < ((x t> y) ■ j)) \> x) 
= x\>yj l^"- {yj > X, (y < (x [> y)) > x) 
= x\>yj 1^^ {yj \> x,y > x) = x\> yj 1^^ {yj t> x, e) 
= x\>yj T""^ (yj [> X, yj > x) = yj > x. □ 
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Claim 13.2.591 

1. (L) \pj{x,max^{x,y)) = \pj{x,y) (R) rpj{x,max^{x,y)) = rpj{x,y) 

2. (L) lb(lpo(xi, max^(xi,yi)), max^(xi,yj)) = i 
(R) rb(rpo(ix, max^(ix, jy)), max^{ix,jy)) = i 



1. (L): \pj{x, max^(x, y)) = x\>y?^^ [j{x <i x) ■ x, j{y <i x) ■ x) = x\>y (e • x,j{y <ix) -x) = 
x\> yl^^ (^j{y <\ x) ■ X, j{y <\ x) ■ x^ = j{y <\ x) ■ x = \pj{x, y). 



lb(lpo(xi, max^(xi, yj)), max^(xi, yj)) 
= lb(lpo(x,max^(x,y)) ■i,max^{x,y) ■ x> yl^"^ 
= ^(max^(x, y) > (o(max^.(a;, y) <\x) ■ x ■ i)) 
= x>yl^^ (^(x > (o(x < x) ■ xi)),^{y \> (o(y < x) ■ xi))) 
= x>y?^'' {\x > (e • xi)),\y [> (x > ov) ■ (y < (x > oy)) > xi)) 
= X > y {\, \e • (y < (x > y)) <] X • i)) 
= X > y {i,\y > X -i)) 

= x>y?^'' {i, \e -i)) =x>y l^"- {i, i) = i □ 



Theorem 13.2.101 

iCRNm[h]{xi, ... ,Xm,y) 

= xx- ...-Xml^^ (e/(/i(lpo(xi, max^(xm)),. . . ,lpo(xr„, max^(xm)),y) • O) 



• ^CKJVmW (>lpo(a;i, max^(xm)), . . . , >lpo(a;m, max^(xm)), y^ 



rCJ?JVm[/i](xi, . . .,Xm,y) 
= xx- ...-Xml^^ (e,rCJ?iVm[/i](rpo(xi, max^(xm))<, . . . , rpo(xrri, max^(xm))<, y) 



• (0 • /i(rpo(xi, max^(xm)), . . . , rpo(xm, max^(xm)), y' 



Proof From the fact that max^(xm) — E <— > x\ ■ . . . • Xm — £ (easily proved by Derived 
Rule I3.2.3P , the theorem follows from Claim 13.2.591 by a straightforward application of Derived 
Rule 13.2.31 generalized to m variables. □ 



Proof 



2. (L): 
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Claim 13.2.601 

1. and^(xiii, . . .,Xmim) = and^(xm) • and^(zm) 

2. jxi = ■■■ = jXm A jyi = • • • = jym and'^{xiyi, . . .,Xmym) = and^(fm) • and^(ym). 

3. and^(xm)< = and^(xi<, ...,Xm<) 

4. and^(xm) <y = and^(xi <]y, . . . ,Xm <y) 
Proof 

1. By a straightforward application of Derived Rule I3.2T31 generalized to m variables. 

2. Again, by a straightforward application of Derived Rule l3.2T3t generalized to m variables, 
together with the previous claim. 

3. From the first claim, by a straightforward generalized NIND. 

4. Directly from the preceding claim, with a straightforward NIND on y. □ 

Theorem [Mm] ^^AND(x) = OR(not«(x)) and ^«OR(x) = AND(not«(2;)) forx^e 

Proof (We prove only the first statement, the second one being almost identical.) By TIND 
on x: ^°AND(i) = = 0R(^^«) = OR(not^(i)), 

^^AND(x) = ^^(AND(2;-^)A^AND(^x)) 
= ^^AND(x^)V^^^AND(^x) 
= OR(not^(x-^))V«OR(not«(^x)) 

= OR(not^(x)^)V«OR(^not^(x)) = OR(not«(x)). □ 

On generalizations of CRN — part II 
Claim 13.2.611 

1. ►pow^(y) = pow^(^2/) = po\N^{y)< (for y ^ e,0,l) 

2. ipow'(y) = pow'(iy) = pow^(y) 

3. pow^(pow^(y)) = pow^{y) 

4. pow^(y) \> y = £ 

5. ispow^(pow^(y)) = e 
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Proof All can be proved by very simple applications of TIND. We give the proof of the third 
statement as an illustration: pow^(pow^(e)) = pow^(e), pow^(pow^(i)) = pow^(l) = pow^(i), 
pow^(pow^(2/)) = pow^(^pow^(y)) • pow^(^pow^(?/)) = pow^(pow^(^y)) • pow^(pow^ (►?/)) = 
pow^(^y) • pow^(^y) = pow^(y). □ 

Claim 13.2.621 

1. e#y = e = x#e 

2. (L) ,{xi #y) = lix # y) • ly (R) i(x # yi) = i(x # y) • ix 

3. (L) ■y)#z) = i(x # z) ■ i(y # z) (R) i(x # (y • z)) = i(x # y) • i(x # 

4. i(x#y) = i(y#x) 

Proof Again, all these statements can be proved by very simple applications of NIND or 
TIND; we give the proof of the third statement (for the left case) as an illustration: i{{x-£)^z) = 
i{x#z) = i{x#z)-i{e#z), i{{x-yi)#z) = i{{x-y)#z)-iz = i{xi^z)-i{y#z)-iz = i{x#z)-i{yi#z). 
□ 

Claim 13.2.631 

1. powdiv^(x, y) = powdiv^dx, ly) 

2. powdiv^(x,y) = powdiv^(a;, pow^(y)) 

3. x> pow^(y) ^ £ —>■ powdiv^(x, y) = £ 

4. powdiv^(pow^(y) #z,y) = y (e, iz) 

Proof 

1. By a simple TIND on y. 

2. By a simple TIND on y. 

3. First, we prove that x [> pow^(y) £^ x< > pow^(^y) 7^ e by proving the contrapositive 
by TIND on y: x\> pow'(e) = £ (so the statement is vacuously true), x< > pow^(^l) = 
£ — > x< T^e— >x/e^xl> pow^(l) = e, and assuming that x< > pow^(^y) = e, then 

x\> y = {x< ■ ►x) > {po\N^{y)< ■ ►pow^(y)) 
= ►x > (x-^ > (pow^(^y) • pow^(^y))) 
= ►x > {{x< > pow^(^y)) • (x-^ < pow^(^y)) > pow^(^y)) 
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= ►x > {{xM < pow^(^7/)) > pow^(^y)) 
= {x< <i pow^(^y)) > (►x D> pow^(^y)) 
= {x< <\ pow^(^y)) \> e = e 

(where we have used the fact that (z ■ y) !> x = y \> (z \> x) = z > {y \> x), which is easy 
to prove by NIND on z). The result then follows by a simple application of TIND. 

4. By TIND on y: powdiv^(pow^(e) ^ z,e) = e = e (e, i-z), powdiv^(pow^(i) # = 
powdiv^(z, i) = iz = i?^^{e,iz), powdiv^(pow^(y) y) = powdiv^((pow^(y) ►y) = 

powdiv^(pow^(y)<^ # z, ►y) = powdiv^(pow^(^y) # z, ►y) = iz = y (e, iz). □ 

Claim 13.2.651 

1. x> (pow^(y) 7^ powdiv^(x, y)) = e 

2. y ^ e ^ powmod^(x, y) \> pow^(y) 7^ e 

3. y ^ e ^ powdiv^(xl, y) = powdiv^(x, y) ■ ((powmod^(x, y) • 1) O pow^(y) (1, e)) 
y ^ e ^ powmod^(xl, y) = (powmod^(x, y) • 1) 1> pow^(y) (e, powmod^(x, y) • l) 

4. powdiv^((pow^(y) z) ■ x,y) = y (e, iz) • powdiv^(x, y) A 
powmod^((pow^(y) z) ■ x,y) = powmod^(2;, y) 

5. y e A X > pow'(y) = e ^ powdiv^(x, y) = powdiv^(x < pow'(y), y) • 1 
y ^ e A X > pow^(y) = e ^ powmod^(2;, y) = powmod^(x <l pow^(y), y) 

Proof 

1. By TIND on y (with hi = hr = •^): x > (pow^(e) # powdiv^(x, e)) = x\>e = e, x\> 
(pow^(i) # powdiv^(x, i)) = X > (1 # ix) = e, 

X [> (pow^(y) # powdiv^(x, y)) 
= X > (pow^(y) 7^ powdiv^(x-^, ►y)) 

= {x< ■ ►x) > ((pow^(^y) • pow^(^y)) # powdiv^(x-^, ►y)) 
= ►x [> (^(x-< > (pow^(^y) # powdiv^(x-^, ►y)))- 

(x-^ < (pow^(^y) # powdiv^(x-<, ►y))) > (pow'(^y) # powdiv^(x-^, ►y))^ 

= (x-4 < (pow^(^y) # powdiv^(x-^, ►y))) > (►x [> (pow^(^y) # powdiv^(x-^, ►y))) 
= (x-^ < (pow^(^y) # powdiv^(x-^, ►y))) > e = e. 



118 



Appendix A. Details of Proofs in the Formal Development of Ti 



2. By TIND on y / e: powmod^(x, 1) > pow^(l) =e>l^e, 

powmod^(a;, y) > pow^(y) 
= ((pow^(2/) # powdiv^(x,y)) D> ix) > pow^(y) 
= ^((pow^(^?/) • pow^(^y)) # powdiv^(x-<, ►?/)) > • i^x)^ 

> (pow^(^?/) • pow^(^y)) 

= (^((pow^(^y) # powdiv^(x-^, ►y)) • (pow^(^y) # powdiv^(x-<, ►y))) 

> (ix-^ • ix< ■ {x (e, 1)))) > (pow^(^y) • pow^(^y)) 
= (^((pow^(^y) # powdiv^(x-^, ►y)) > ix-<)- 

((pow^(^y) # powdiv^(x-^, ►y)) > ix-< • (x l""- {e, 1)))) [> (pow^(^y) • pow^(^y)) 
= f powmod^(x-^, ►y) • {powvnod^{x<, ►y) • (x (e, 1))) j > (pow^(^y) • pow^(^y)) 



= powmod^(x-^, ►y) > pow^(^y) • (powmod^(x-<, ►y) • (x (e, 1))) > pow^(^y) 7^ e 

(where we have used the fact that y<\x = eAw<\z = e—i- wy \> izix = w \> iz ■ y \> ix, 
which is a direct consequence of Claim [3^.2.541 and the fact that zy [> x = y > (z > x)). 

3. We prove the first statement by TIND on y 7^ e: powdiv^(xl, 1) = ixl = ix • (1 > 1 
(l,e)) = powdiv^(x, 1) • ((powmod^(x, 1) • 1) [> 1?^^ (1, e)), and before proving the inductive 
case, we can show by TIND on y 7^ e that 

(powmod^(x,y) • 1) > pow^(y) (l,e) 

= X [((powmod^(x-<, ►y) • 1) > pow^(^y) • powmod^(x-^, ►y) > pow^(^y)) (l,e), 



((powmod^(x-<, ►y) • 1) > pow^(^y) • (powmod^(x-<, ►y) • 1) > pow^(^y)) (l,e)j 
= X (^e, (powmod^(x-^, ►y) • 1) > pow^(^y) T""^ (l,e)) 

(the proof is similar to that of the preceding claim) , so that 

powdiv^(xl, y) 
= powdiv^((xl)-^, ►y) 

= X (powdiv^(x-^, ►y), powdiv^(x-^ • 1, ►y)) 

= X (powdiv^(x, y), powdiv^(x-<, ►y) • (powmod^(x-<, ►y) • 1) > pow^(^y) (1, e)) 

= powdiv''(x,y) • (x T"""- (e, (powmod''(x-<, ►y) • 1) > pow^(^y) l^"" (l,e))) 
= powdiv^(x,y) • ((powmod^(x, y) • 1) \> pow^(y) (l,e)). 

As for the second statement, it follows directly from the first by the definition of powmod^ 
and the fact that powmod^(x, y) > pow^(y) ^ e ^ pow^(y) > (powmod^(x, y) • 1) = e. 
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4. By NIND on x: powdiv^((pow^(y) ^ z) ■ £,y) = y (e, iz) = y (e, iz) ■ powdiv^(£, y) A 
powmod^((pow^(?/) # z) ■ e,y) = e = powmod^(£, j/), 

powdiv^((pow^(y) 7^ z) • xi,y) 

= powdiv^((pow^(y) # z) • x,y)- 

((powmod^((pow^(y) # z) • • 1) > pow^(y) (l,^)) 

= {y r'^ {e, iz)) ■ powdiv^(x,y) • ((powmod^(x, y) • 1) > pow^(y) (l,e)) 

= {y (e, iz)) • powdiv^(xl, y), 
powmod^((pow^(y) # z) • a;i, y) 

= (pow^(y) # powdiv^((pow^(y) # z) ■ xi,y)) D> i((pow^(y) # z) • xi) 

= (pow^(y) # ((y (e, iz)) ■ powdiv^(.x?; y))) > i(pow^(y) # z) • i(.x?;) 

= (pow^(y) # (y (£, iz))) ■ (pow^(y) # powdiv^(xz, y)) > (pow^(y) # iz) • i{xi) 

= pow^(y) # powdiv^(xi,y) > i(xz) 

= powmod^(a;i,y). 

5. By NIND on x: y 7^ e A e t> pow^(y) = £ is false so the statement is vacuously true, and 
y ^ e Axi> pow^(y) = £^(y^£Aa;> pow^(y) =£)V(y^£Aa;> pow^(y) = 1), so we 
prove the statement by cases. 

First, if y 7^ £ Axl> pow^(y) =e (which implies that i(xl < pow^(y)) = < pow^(y)) • 1), 
powdiv^(xl, y) 

= powdiv^(x, y) • ((powmod^(.T, y) • 1) [> pow^(y) 

= powdiv^(a; < pow^(y),y) • 1 • ((powmod^(x < pow^(y)) • 1) > pow^(y) (1,£)) 
= powdiv^((x < pow'(y)) • l,y) • 1 = powdiv^(xl < pow^(y),y) • 1, 
powmod^(xl,y) 

= (powmod^(a;,y) • 1) > pow^(y) (£, powmod^(a;, y) • 1) 

= (powmod^(x < pow^(y),y) • 1) > pow^(y) (s, powmod^(.x' < pow^(y),y) • 1) 
= powmod^((a; <] pow^(y)) • l,y) = powmod^(xl O pow^(y),y). 

Second, if y 7^ £ A a; [> pow^(y) = 1 (which implies that i(xl < pow^(y)) = e), 

powdiv^(a;l, y) = powdiv^(a;, y) • ((powmod^(a;, y) • 1) l> pow^(y) (l,e)) 
= £ ■ ((ixl) > pow^(y) 
= 1 = powdiv^(£,y) • 1 
= powdiv^(a:;l < pow^(y),y) • 1, 
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powmod^(xl, y) 



(powmod^(x, y) • 1) \> pow^(y) ? 



(e, powmod^(x, y) ■ 1) 



ixl > pow^(y) (e, ixl) 



e = powmod^(e, y) 



powmod^(xl < pow^(j/),y). 



□ 



On "=^" and "<^" 
Theorem 13.2.131 

1. X =^ e ^ X = 

2. x=^ y^ lpo(x, y) = lpo(y, x) 

Proof 

1. By a straightforward NIND on x. 

2. By the preceding property and Derived Rule 13.2.^ x =^ e*-^x = oa;^ lpo(2;, e) = \poi£, x) 
(and similarly for e y), xi yj ^ x =^ y A^ {i j) ^ Ipo(a^) u) = ^Poiu^ x) A i = 
j ^ lpo(x, y) • i = lpo(y, x) • j ^ IPo(2;^, yj) = \Poiyj, xi). □ 

Claim 13.2.671 

1. xO =~ yO <-> X y <-> xl yl 

2. ^B(xO=~yl) 

3. ^B(xl =~ yO) 

Proof Directly from Theorem 13.2.131 xi =^ yj ^ \pQ{xi,yj) = lpo(yi, xz) ^ lpo(x,y) • i = 



\po(.y^x) - j ^ ^Poix,y) 


= IPo(2/> 


x) A i = 


j ^ 


X = 


y Ai 




Claim 13.2.681 














i. xO <~yO = X <~ 


y V^ (x 


=~yA« 


0<^ 


'0) 


= X <~ 


y 


2. xO yl = X <~ 


y V^ (x 


=^ y AB 


<^ 




= X <^ 


y 


3. xl <~ yO = X 


y V^ (x 


=~y A« 


1 


^0) 


= X <^ 


y 


4. xl <^yl = X <~ 


y V^ (x 


=~y A« 


1 


^1) 


= X <^ 


y 



Proof From Theorem 13.2.101 we have that xi yj = x <~ y V^ (x y A^ i <^ j), so the 
theorem follows directly from the preceding claim. □ 



On "=~" AND "<~" 



121 



Claim 13.2.691 

1. -«(x <~e) 

2. <~ x) 

3. -«(e <^ ox) 

Proof A straightforward proof by NIND, using the preceding claims. □ 

Lemma 13.2.721 

1. X =^ y = Ox =^ y = X Oy = Ox =^ Oy 

2. X y = Ox <~ Oy 

3. X <'^' y = Ox y = X <'^' Oy 
Proof 

1. Direct from the fact that (lpQ(Ox,y) = lpo(x,y)) V (lpo(Ox,y) = • lpo(x,y)) (which can 
easily be proved by cases depending on the length of x > y), and by Derived Rule l3.2THl 

2. By a simple application of Derived Rule 13.2.31 and the first claim (we show only the 
inductive step, the base cases being just as simple): Oxi Oyj = Ox Oy (Ox 

Oy i <^ j) = X <^ y (x =^ y i <^ j) = xi yj. 

3. By the second claim: 

Ox y = Ox < y (o(y < Ox) • Ox <~ y, Ox <~ o(Ox < y) • y) 

= X > y (Ox <~ • o(a; < y) • y, o(y < x)< • • x <~ y) 
= X > y T"""- (x o(x < y) • y, o(y < x) • x <~ y) 
= x<~y 

(and similarly for x <^ Oy). □ 

Theorem 13.2.151 x =~ y A y <~ z ^ x z and x y Ay z ^ x z 

Proof Follows directly from the (already proven) facts that x y ^ lpo(2;) max^x, y, z)) = 
lpo(y, max^(x, y, z)) and y <~ z ^ lpo(y, max|(x, y, z)) <^ Ip^iz, max^(x, y, z)). □ 

Corollary 13.2.731 x =~ y A y <^ z ^ x <^ z and x y Ay z ^ x z 

Proof Directly from the theorem. □ 
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Theorem 13.2.161 x y A y <^ z ^ x z and x >^ y Ay >^ z ^ x >^ z 

Proof By Derived Rule I3.2.3t the base cases for z = e and y = £ are trivially true since 
e x, while the third base case e <^ y Ay <^ 2— >e z is itself proved by Derived Rule 13.2^31 
the two base cases are again trivially true, and e <''^ A^yj <^ zk = [e <^ y\J^ {e =^ y A^0<^ 
i)) A« {y <^ z (y =~ z A« j <^k)) = (e <^ y y <~ z) V« (e <~ y A« y =~ z A^ j A;) V« (e =~ 
yA«0<«jA«y <~ z)V«(e yA«0<«jA^y =~ zA^jK^k) = e <~ zV«(e =~ zA^O<«A;) = e <~ zk 
(the general inductive step is almost identical). □ 

Corollary 13.2.741 x <~ y A y z ^ x <~ z and x y A y >~ z ^ x >~ ^; 
Proof Directly from the theorem. □ 

Corollary 13.2.751 x <~ y A y <~ z ^ x <^ z and x >^ y A y >^ z ^ x >^ z 
Proof Directly from the theorem. □ 

On "I- 1" and "suce" 

Claim \3.'2.7^\ 

succ'^(e) = 1 
succ'"'(xO) = 0x1 
succ'^'(xl) = succ'^'(x) • 

Proof Simple proofs by NIND (proving the relevant properties first for the auxiliary func- 
tion cuss*^, and then for succ'"^). □ 

Claim 13.2.771 

succ'^(Ox) = • succ'''(x) 

succ~(lx) = 'succ~(x) • -«'succ~(x) • >succ~(x) 
'succ~(x) = AND(lx) 

succ~(x) = AND(lx) ?^ (1 • ox,0 • >succ~(x)) 

Proof All the proofs are simple, but we will illustrate them by proving the second property, 
by NIND on x: succ~(le) = 10 = 1 • ^^1 • >1 = 'succ"*'('^) ' -'''succ~(e) • >succ''(e), succ''(lxO) = 
01x1 = X0xl)-^«\0xl)->(0xl) = 'succ~(xO)-^«'succ~(xO)->succ~(xO), succ~(lxl) = succ~(lx)- 
= ^succ~(x) • ^«^succ~(x) • >succ^(x) • = \succ~(2;) • 0) • ^^Xsucc^(x) • 0) • >(succ~(^) • 0) = 
'succ~(xl) • ^^^succ"'(xl) • >succ~(xl). □ 



On "H" and "succ'^" 123 
Theorem 13.2.171 

1. X <^ succ^(x) 

2. X >^ y = X >~ succ''(y) 

Proof 

1. A simple direct proof by NIND: e succ^(e) = e 1, xO <^ succ^(xO) = xO <^ 0x1 = 
X <^ Ox, xl succ^'(xl) = xl <'^' succ^'(x) • = X <''^' succ''^'(x). 

2. By Derived Rule[3X3l e >~ y = = e >~ succ~(y) (since succ~(y) e), x >^ e = x >^ 
succ^(e) (proved by an easy NIND on x), 

xi >~ yO = X >^ y (x =^ y i >^ 0) 
= X y (x y i) 

= X >~ Oy (x Oy A^ i >" 1) V« (x =~ Oy A^ i) 
= xi Oyl xi =~ Oyl 
= xi Oyl = xi >~ succ~(yO), 
xi >^ yl = X >~ y (x =~ y A« i >^ 1) 
= X > y 
= X >~ succ~(y) 

= X >~ succ~(y) V« (x =~ yA^i>^ 0) 

= X >~ succ'^(y) V« (x =~ y A^ i 0) (x =^ y A« i =^ 0) 

= xi succ'''(y) • xi =~ succ''(y) • 

= xi >~ succ~(y) . = xi >~ succ^(y 1) . □ 

Claim 13.2.781 |x| = \jx\ 

Proof By TIND on x: \e\ = e = \je\, |i| = 1 = |1| = |ii[, [x| = x (l^-^l • 0, |x-^| • l) = 
jX ■ 0, \jX<\ ■ l) = \jx\. □ 

Theorem 13.2.191 |xi| =~ succ~(|x|) 
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Proof By TIND on x: \ei\ = 1 = succ~([e|), = 10 = succ^(l) = succ^(|j|), 



xi\ = xi 7'^'' (l(xi)-^l • 0, \{xi)M\ ■ l) 
= x?^^ (I(xi)^l ■l,\{xi)<\ -0) 
= 3;?''^ {\x<\ ■ 1, \x< •'(►x -i)! -O) 
= x?''^ • 1, \x< • j[ -0) 

=~x?^^ (succ~(|x^| •0),succ^(|x^|) -0) 
= x?^^ (succ'^'dx^l •0),succ~(|x^| • 1)) 
= succ^'(|x|) 



(where the fifth equahty, where "=^" is mtroduced, holds by the induction hypothesis). □ 



On "masking" functions 
Theorem i;^.2.2()I 



Proof By a straightforward NIND. □ 
Corollary 13.2.811 firsto(x) = firsti(not^(x)) 

Proof By a straightforward NIND, from the preceding theorem. □ 

On binary addition 

Theorem I8.1>.'ill x y = y x 

Proof Direct from the commutativity of the functions involved (i.e., xor^ and and^). □ 
Lemma [3X82] 



firsto(Ox) = I ■ qx 
firsti(Ox) = • firsti(x) 



firsto(lx) = • firsto(x) 
firsti(lx) = 1 ■ ox 



carry^(xO, 1) = carry'^(x, e) ■ = qxO 
carry''(xl, 1) = x (l, carry~(x, 1) • l) 
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Proof (We prove only the second statement, the first is a simple application of NIND on 
X.) By NIND on x: carry~(e • 1, 1) = 1 = e T^^- (l,carry~(e, 1) • 1), 

carry"'^(Oxl, 1) = • carry"'^(xl, 1) 

= {)-xr'- (l,carry~(x,l) • l) 

= x?^^(0-l,0-carry~(x,l)-l) 

= X (carry~(0, 1) • l,carry~(Ox, 1) • l) 

= carry~(Ox, 1) • 1, 
carry*'^(lxl, 1) = Varry'"^(xl, 1) • carry''^(xl, 1) 

= X (Xl) • 1, Xcarry~(2;, 1) • 1) • carry^(x, 1) • l) 

= X (11, Varry''(x, 1) • carry~(x, 1) • l) 

= X (carry~(l, 1) • l,carry^(lx, 1) • l) 

= carry~(lx, 1) • 1. □ 

Theorem 13.2.221 x +~ 1 = x (O • succ~(x) , succ~(x)) =~ succ''(x) 

Proof By NIND on x: e +~ 1 = 01 = e (O • succ^(e), succ^(e)) , 

xO +~ 1 = xorf (carry~(xO, 1) • 0, xO, 1) 
= xorf(OoxO,OxO,Ooxl) 
= 0x1 

= succ''(xO), 
xl +~ 1 = xorf (carry~(xl, 1) • 0, xl, 1) 

= xorf (carry~(x, 1) • 10, 0x1, OqxI) 
= xorf (carry~(x, 1) • 1, Ox, o2;0) • 
= xor3(carry'^(x, 1) • 0, Ox, o2;l) • 
= xorf (carry~(x, 1) • 0,x,l) • 
= (x +^ 1) • 
=~ succ~(x) • 
= succ~(xl). □ 
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Claim 13.2.831 For jx = jy, 

• ox 

xO 

• carry''^'(x, y) 
Varry^'(x,y) • carry''^'(a;, y) 

1 • carry''^'(x, y) 

Proof All these properties can be proved with a simple application of Derived Rule 13.2.61 
or directly from the definition of carry^'. The last three depend on the following facts. 

maskbit(andf (Ox, Oy), firsto(xorf (Ox, Oy))) = maskbit(0 • andf (x, y), firsto(0 • xorf (x, y))) 

= maskbit(0 • andf (x, y), 1 • oxorf (x, y)) 
= 

maskbit(andf (Ix, Oy), firsto(xorf (Ix, Oy))) = maskbit(0 • andf (x, y), firsto(l • xorf (x, y))) 

= maskbit(0 • and^x, y), • firsto(xorf (x, y))) 
= maskbit(andf (x, y), firsto(xorf (x, y))) 

maskbit(andf (Ix, ly), firsto(xorf (Ix, ly))) = maskbit(l • andf (x, y), firsto(0 • xorf (x, y))) 

= maskbit(l • andf (x, y), 1 • oxorf (x, y)) 
= 1 □ 

Claim 13.2.841 For jx = jy, 

X +^ e = Ox 
X X = xO 
Ox +~ Oy = • (x +~ y) 

Ix +~ Oy = Xx +~ y) • -^Xx +^ y) • >(x +~ y) 
Ix ly = 1 • (x +~ y) 

Proof Directly from the corresponding properties for carry"*, where we have used the fact 
that ^{x +''^'y) = ^carry'^Xx, y). Note that the second property implies that >succ'^(x +^'x) = xl. 



carry (x,e) = 
carry^(x,x) = 
carry'^XOx, Oy) = 
carry'^(lx, Oy) = 
carry''^'(lx, ly) = 
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Claim 13.2.851 



xO +~ yO 
xl +~ yO 
xl +~ yl 



{x +~2/)-0 
>succ''(x y) ■ 



Proof The first two properties follow directly from the Claim above by Derived Rule l3.2T6l 
We prove the third property because it is more involved. First, note that -i^AND(x+^y) can be 
proved directly from Claim [3^.2.841 by Derived Rule l3.2T6l This implies that ^succ''(x +^ y) = 0, 
which in turn implies that succ'"^(l • (x +^ y)) = 01 • >succ'^(x Now, we prove the third 

property by Derived RuleESSl 1 +~ 1 = 10 = >(01) • = >succ~(0) • = >succ^(e • 0, 

0x1 +~ Oyl 
= 0- (xl+~2/l) 
= • >succ''(x +~ y) • 
= succ~(x -0 
= >(0-succ~(x +''y)) -0 

= >succ'"*'(0 • • 

= >succ'''(Ox +~ Oy) ■ 0, 
1x1 +~ lyl 
= 1- (xl+~yl) 
= 1 • >succ^(x y) • 
= >(01 • >succ~(x -0 
= >succ~(l • (x+~y)) -0 

= >succ~(l^ ly) • 0, 
1x1 Oyl 

= Xxl +~yl) • -«\xl +^yl) ■ >(xl +^yl) 

= X>succ^(x +^ y) • 0) • -«X>succ~(x +~ y) ■ 0) ■ >(>succ^(x +^ y) ■ 0) 
= Vsucc~(x +~ y) ■ -«Vsucc~(x +~ y) ■ »succ^(x +~ y) ■ 

= Xx (Vsucc'^(l>(x • -^Vsucc'^(l>(x • »succ~(l>(x • 0, 



>succ~(0>(x +^y)) • -^Vsucc~(0>(x • »succ~(0>(x • 



(x + y)7 [-^ 



B\ 



'succ~(>(x +~ y)) • ^succ''(>(x y)) • >succ~(>(x +~ y)) • 0, 



^succ^(>(x y)) • -^^succ^(>(x +~ y)) • >succ~(>(x +~ y)) • 



128 Appendix A. Details of Proofs in the Formal Development of Ti 

= \x +^ y) (-«AND(l>(x +~ y)) ■ succ~(>(x +~ y)) ■ 0, succ~(l>(a; +^ y)) ■ o) 

= \x +~ y) (l ■ succ^'(>(a; +~ v)) ■ 0, succ~(l>(a; +'^' v)) ' o) 

= Xx +~?/) (>(01 • succ~(>(x •0,>(0-succ~(l>(x +^y))) -o) 

= \x +^ y) (^>succ^'(10>(2; y)) • 0, >succ~(01>(x +~ y)) ■ o) 

= >succ~C(^ y) ■ +~ y) ■ +~ y)) ■ o 

= >succ~(lx +~ Oy) • 0. □ 

Theorem 13.2.231 

J. X +~ succ~(y) =~ succ~(x +~ y) 

2. X +~ (y +~ z) =~ (x +~ y) +~ z 

3. y <~ z ^ X +^ y <~ X +^ z 

4. X y A z It; — > X +^ z y +^ u; 

5. X y A z Ti; — > X +^ z <~ y +^ w 

6. X y f\z <^ u; — > X z y +^ u; 
Proof 

1. By Derived Rule [MS e succ~(y) =^ succ^(y) =^ succ~(e+~y), x +^ succ~(e) = 
X +~ 1 succ~(a;) =~ succ~(x e), 

xO +~ succ~(yO) =^ xO +~ yl 
(x +"y)-l 

=^succ^((x +~y) -0) 

=~succ~(xO +~yO), 
xO +^ succ~(yl) =^ xO +~ succ^(y) • 

=^ (x +^succ%))-0 

=~ succ~(3; +'^' y) • 
=~succ~((x +~y)-l) 
succ~(2;0 +~yl), 
xl+^succ%0) =~xl+^yl 

=~ succ~(x y) • 
=~succ~((x +~y)-l) 
=~succ~(^l+~yO), 
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xl +~ succ~(yl) xl +~ succ~(y) • 
=^ (x +~succ~(y)) • 1 
=^succ^(x • 1 

succ'^(succ~(x -0) 
=^succ~(xl+~yl). 

2. By Derived Rule l3.2l3t generalized to three variables: £+^'{y+^ z) z =^ {e+^y)+^ z 

(and similarly for x, e, z and x, y, e), xO +^ (yO +~ zO) =^ xO +~ (y +~ 2;)0 =~ (x +~ (y +~ 
2))0 =~ ((x y) +~ z)0 (x +~ y)0 +^ zO =~ (xO +~ yO) zO, xl +~ (yO +~ zO) =~ 
xl+~(y+'^z)0 =^ (x+~(y+~z))l =~ ((x+^y)+~z)l =~ (x+~y)l+-^zO =~ (xl+^yO)+'^zO 
(and similarly for xO, yl, zO and xO, yO, zl), xO +^ (yl zl) =^ xO +^ succ~(y +^ z)0 =^ 
(x+~succ^(y+~z))0 =^ succ'^(x+~(y+^z))0 =~ succ~((a;+~y)+~^)0 =~ (x+~y)l+~zl =^ 
(xO +~ yl) zl (and similarly for xl, yO, zl and xl, yl, zO), xl +^ (yl +~ zl) xl +^ 
succ~(y+~z)0 =~ (x+~succ~(y+~z))l succ^'(a;+~(y+~2))l succ^'((a;+~y)+~^)l 
(succ~(x +^ y) +~ z)l =~ succ~(x y)0 +~ zl =~ (xl +~ yl) +~ zl. 

3-6. Similar to the last case, straightforward, case-by-case proofs by Derived Rule 13. 231 □ 

On iterated sums 
Claim 13.2.861 

CScar3(xO, yO, zO) = CScar3(x, y, z) • 
CScar3(xl, yO, zO) = CScar3(xO, yl, zO) = CScar3(xl, yO, zl) = CScar3(x, y, z) • 
CScar3(xO, yl, zl) = CScar3(xl, yO, zl) = CScar3(xl, yl, zO) = CScar3(x, y, z) • 1 

CScar3(xl, yl, zl) = CScar3(x, y, z) • 1 
oCScar3(x, y, z) • = oCSadd3(x, y, z) = • omax3(x, y, z) = • max§(ox, oy, o^) 
• max§(ox,oy,oz) = CScar3(ox, oy, o^) • = CSadd3(ox, oy, o^^) 
oCScar(x, y, z, w) = oCSadd(x, y, z, u;) = 00 • oniax4(x, y, z, if) = 00 • max4(ox, oy, qz, qw) 
00 • max4(ox,oy,o^;,o^^) = CScar(ox, oy, o^, ow^) = CSadd(ox, oy, o^^, ow) 

Proof All can be proved with a very simple application of Derived Rule 13.2.61 generalized 
to three variables, directly from the definitions of the functions involved. □ 

Lemma 13.2.871 

(CScar3(succ''(x), y, z) • 0) -|-^ CSadd3(succ^(x), y, z) 

=^ succ'"'((CScar3(x, y, z) • 0) -I-'' CSadd3(x, y, z)) 
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Proof The proof is a straightforward, if tedious, apphcation of Derived Rule I3.2!3] (gener- 
ahzed to three variables). First, the base cases for e, y, z and x, e, z and x, y, e are proved (each 
one with another apphcation of Derived Rule 13 .231) . and then the eight cases from xO,yO, zO 
to xl,yl,zl are proved from the assumption that the lemma holds for x,y,z. We do not 
show the full proof here as it is not particularly instructive; instead, we give parts of the proof 
for two illustrative cases. First, in the proof of the base case (CScar3(succ^'(e), y, z) • 0) 
CSadd3(succ~(e),y,z) =~ succ~((CScar3(e, y, z) • 0) +~ CSadd3(e, y, z)) by Derived Rule EM 
we show the case for yl, zl. 

(CScar3(succ'^(e), yl, zl) ■ 0) CSadd3(succ'^(e), yl, zl) 
=~ (CScar3(l, yl, zl) ■ 0) +~ CSadd3(l, yl, zl) 
=^ (CScar3(e, y, z) ■ 10) +~ (CSadd3(e, y, z) ■ 1) 
=^ ((CScar3(e, y, z) ■ 1) +~ CSadd3(e, y, z)) ■ 1 
=^ succ^(((CScar3(e, y, z) ■ 1) +~ CSadd3(e, y, z)) ■ O) 
=~ succ^'((CScar3(e, yl, zl) ■ 0) +~ CSadd3(e, yl, zl)) 

Second, in the inductive step, we show the case for xO,yl,2;0. 

(CScar3(succ~(xO), yl, zO) ■ 0) +~ CSadd3(succ^'(2;0), yl, zO) 
=~ (CScar3(xl, yl, zO) ■ 0) +~ CSadd3(a;l, yl, zO) 
(CScar3(x, y, z) ■ 10) +~ (CSadd3(x, y, z) ■ 0) 
=^ (succ^(CScar3(x, y, z) ■ 0) +~ CSadd3(a;, y, z)) ■ 
=~succ~(((CScar3(x,y,z) -0) +~ CSadd3(x, y, z)) • l) 
=~ succ~((CScar3(a;, y, z) ■ 00) +~ (CSadd3(x, y, z) ■ 1)) 
=~ succ~((CScar3(xO, yl, zO) ■ 0) +~ CSadd3(2;0, yl, zO)) □ 

Theorem 13.2.241 CScar(x, y, z, w) +~ CSadd(x, y, z, w) x +~ y +~ z w 

Proof As for the preceding lemma, the proof is a straightforward, if tedious, application 
of Derived Rule [3T2T3] (generalized to four variables). First, the base cases for e,y,z,w and 
X, e, z, w and x, y, e, w and x, y, z, e are proved (each one with another application of Derived 
Rule l3.2T3|) . and then the sixteen cases from xO, yO, zO, to xl, yl, zl, wl are proved from the 
assumption that the lemma holds for x,y,z,w. We do not show the full proof here as it is 
not particularly instructive; instead, we give parts of the proof for one illustrative case. In the 
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inductive step, we show the case for xl,yl, zl,wl. 

CScar(xl, yl, zl, wl) +^ CSadd(xl, yl, zl, wl) 
=^ CScar3(CScar3(x, y, z) ■ 10, CSadd3(a;, y, z) ■ l,wl) ■ 0+^ 

CSadd3(CScar3(x, y, z) ■ 10, CSadd3(x, y, z) ■ 1, wl) 
=^ CScar3(succ'"'(CScar3(x, y, z) ■ 0), CSadd3(a;, y, z),w) ■ 10+^ 

CSadd3(succ~(CScar3(x, y, z) -0), CSadd3(x, y, z)^w) ■ 
=^ succ'"*'^CScar3(succ~(CScar3(x, y, z) • 0), CSadd3(2;, y, z), u;) • +~ 

CSadd3(succ"'^'(CScar3(x, y, z) ■ 0), CSadd3(2;, y,z),iv)^ -0 
=^ succ''(succ"'^'(CScar(x, y, z, ui) +^ CSadd(x, y, z, -w))) -0 
=^ succ~(succ^(x +^ y +~ z +^ u;)) • 
=~ succ~(x y) • +~ succ~(z w) ■ 
=^ xl +~ yl +~ zl +~ wl □ 

Claim 13.2.881 

1. CARADD(ox) =^0 

2. sum(ox) = CAR(ox) +^ ADD(ox) =^ +~ =~ 
Proof 

1. By TIND on x: CARADD(oe) = e =~ 0, CARADD(oi) = 00 0, 

CARADD(ox) = CScar(CAR(ox^),ADD(o2;^),CAR(^ox),ADD(^ox))- 
CSadd(CAR(ox-^), ADD(ox^), CAR(^ox), ADD(^ox)) 

CScar(0, 0, 0, 0) • CSadd(0, 0, 0, 0) 
=~ • =^ 0. 

2. Direct corollary of the first claim. □ 



Theorem 13.2.251 sum(x) — sum(x-^) +'"'sum(^x) 
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Proof 

sum(x) = CScar(CAR(x-<),ADD(2;-<),CAR(^x), ADD(^x))+^ 
CSadd(CAR(x^), ADD(2;^), CAR(^x), ADD(^x)) 
=~ CAR(x^) ADD(x^) +-^CAR(^x) ADD(^x) 
= sum(x-^) sum(^x) □ 

From this theorem, it is possible to prove that sum(x?/) =~ sum(2;) +^sum(y) with a sequence 
of lemmas and theorems similar to the ones used to show that AND(xy) = AND(x) AND(y). 
In particular, we have that sum(xO) sum(x) +~0 sum(x) and sum(xl) sum(x) +''1 
succ-'^'(sum(x)). 

Theorem [3226] sum(x) <''sum(ix) |x| 

Proof By TIND on x: sum(e) = sum(ie) = =~ \e\, sum(i) =^ i <^ 1 =^ sum(l) =^ |1|, 
sum(x) sum{x<) +^sum(^x) sum(ix-^) +~ sum(^ix) =^ sum(ix) =~ sum(ix-^) +~ 
sum(^ix) =^ \ixM\ +^|^ix| =^ |x|. To complete the inductive case for sum(ix) = |x|, we need 
to prove that |x| =^ \x-<\ +^ [►xj by TIND on x: the base cases are trivial, and 

\x<\ +~ |^x| =^ X (lix^l +~ lix^l, lix^l lix^ • 1[) 
=^x?^^ (Iix^l -0,1x^1 +~succ~(k'^|)) 
=~x?^^ (|x^| •0,succ~([x^[+~[x^[)) 
=~x?^^(|x^|-0,|x^|-l) 

=~|xl. □ 

Lemmas for the proof of PHP 

Lemma A.l AND(orf(not^(x),y)) ^^sum(x) <^sum(y) 

Proof By Derived Rule [3X3l 

AND(orf (not«(e), y)) sum(e) sum(y) = AND(y) e <~ sum(y) = 1, 
AND(orf(not«(x),e)) ^^sum(x) <~sum(e) 

= AND(not«(x)) sum(x) =~ e = -^OR(x) x qx = 1, 
AND(orf(not^(xi),yi)) ^^sum(xi) <^sum(yj) 

= AND(orf(not«(x),y)) j) ^^sum(x) +~i <''sum(y) 

= AND(orf(not«(x),y)) A^ V« j) sum(x) <~ sum(y) A'' -^(i = 1. □ 
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Lemma A. 2 AND(orf (not^(x), y)) OR(andf (not«(x), y)) sum(x) <^ sum(y) 

Proof By Derived Rule [3X3) 

AND(orf(not^(e),y)) OR(andf (not^(e), y)) ^«sum(e) <^sum(y) 

= AND(y) A« OR(o2/) e <~ sum(y) 

= O^^e <~sum(y) = 1, 
AND(orf(not«(x),e)) A^ OR(andf (not^(x), e)) ^^sum(2;) <-''sum(e) 

= AND(not^(x)) A^ OR{ox) -^^ sum(x) e 

= O^^O = 1, 

AND(orf(not''(xi),yj)) a'' OR(andf (not''(xi), yj)) ^^sum(xz) <'^sum(yj) 
= (AND(orf(not^(x),y)) A« A«j)) A« (OR(andf (not«(x), y)) V« A«j))-" 
sum(j;) <'^'sum(y) +''^ j 

(and a simple check of all four cases for the possible values of i and j shows that the property 
holds in each one). □ 

Lemma A. 3 maskbit(x, firsti(x)) = OR(x) 

Proof By NIND on x: maskbit(e, firsti(e)) = markbit(e,e) = OR(andf (e, e)) = OR(e), and 
maskbit(xi,firsti(xi)) = maskbit(x, firsti(x)) A^OR(x) ?^ (0, i)) = OR(x) V^OR(x)?^(0,i) = 
OR(x) ?^ (l,i) = OR(a;) V«i = OR(xi). □ 

Lemma A. 4 

(L) Ri^lb(x, y) = maskbit(x, (1 • o(y > xl))<) 
(R) Ri^rb(x, y) = maskbit(3;, (1 • oy)<) 

Proof (L) By Derived Rule [3X3l 

maskbit(e, (1 • o(y l> 1))<) 

= y (maskbit(e, 1), maskbit(e, e)) 

= y?^MO,0) = = ««e = ««lb(e,y), 
maskbit(x, (1 • o{e \> xl))<) 

= maskbit(x, 1 • qx) 

= OR(andf(0x, 1 -oa;)) 

= (0 A^l) V^OR(andf(x,ox)) 

= OR(ox) 

= = Ri'^e = ^''\b{x,e), 
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maskbit(zx, (1 ■ o{yj > ixl))<j 
= maskbit(zx, (1 • o{y > xl))<) 
= y ^maskbit(ix, 1 • qx), 

maskbit(ix,o(ia:; < (1 • o(y l> ' (1 ' o{y > xl))<)^ 

= y (^{i 1) maskbit(x, qx), 

maskbit(ia;, o(ia; < (y t> xl)) • (1 • o(y > xl))<j^ 
= y [i V"" 0, maskbit(ix, Q ■ o{x < {y > xl)) • (1 • o(y > xl))<)^ 
= y (i, {i A" 0) V'' maskbit(x, o(x < (y > xl)) • (1 • o(y > a;l))<)^ 
= y (i, maskbit(x, (1 • o{y > xl))<)^ 
= y?^M^,«"lb(x,2/)) 
= «^lb(ix,2/j)- □ 

Lemma A.5 -«OR(delfirsti(x)) A« lb(x, y) ^» -«OR(lc(x, y<)) 

Proof ByNINDonx: -»OR(delfirsti(£)) A»lb(£,y)^»-»OR(lc(£,y<)) = -»0R(£) A»e^» 
-i^OR(£) = — >^ = 1, If y = £ or y >^ xi, then lb(xi,y) = £, which makes the antecedent of 
— >^ false and the entire statement trivially true. Hence, we prove the inductive step under the 
implicit assumption that y s and y <^ xi. 

^^OR(delfirsti(xi)) A^ lb(xi, y) ^«OR(lc(xi, y<)) 

= y =^ xi A^ -^(OR(x) ?^ (0, i))) A^ -^OR(delfirsti(x)) A^ i -^OR(lc(xz, (xi)<)), 

^"{i A'' -^(OR(x) ?^ (0, i))) A'' -^OR(delfirsti(x)) A'' lb(x, y) -^OR(lc(xz, y<))) 
= y =^ xz (-^(i A^ OR(x) (1, -^i)) A« i A" -«OR(delfirsti(x)) ^« -«OR(lc(xi, x)), l) 

= J/ =^ (-''(OR(x) (i, 0)) A« OR(x) ?^ (i, i) A^ -^OR(delfirsti(x)) -^OR(x), l) 

= y =^ xi (OR(x) 7" (0, i) OR(x) 7" (0, 1), l) 
= y='^xi (1, 1) = 1 □ 
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